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PREFACE 

This  MemoFandum  is  an  interim  report  on  automatic 
ffiachine  analysis  (parsing)  of  English  which  will  eventually 
enable  the  electronic  computer  to  receive  natural  ingllsh 
sentences  and  convert  them  internally  into  formal  structures 
suitable  for  further  machine  manipulation. 

In  this  Memorandum  the  bases  upon  which  a  machine 
Will  accept  and  act  upon  English  sentences  are  given* 
together  with  the  codes  and  rules*  and  the  approach  to  be 
used  in  the  further  development  of  parsing  is  analyzed. 

The  Immediate  purpose  is  to  develop  a  parsing  program 
for  automatic  information  retrieval*  worK  in  information 
retrieval  now  goes  beyond  statistical*  probabilistic 
methods*  and  as  probes  for  relevant  answers  to  requests 
have  been  refined  by  adding  information  about  sentence 
structure,  the  need  for  something  more  than  ad  hoc  or 
"quick  and  dirty"  methods  of  sentence  structure  determination 
is  Increasingly  apparent. 

The  development  of  machine  parsing  has  been  undertaken 
by  The  hand  Corporation  as  a  part  of  its  continuini  research 
effort  in  the  field  of  computer  technology  as  an  adjunct 
to  further  studies  in  information  retrieval.  Command  and 
Control*  etc. 

The  approach  to  automatic  parsing  was  established 
through  discussions  with  eharies  F.  Hockett,  Professor  of 


Linguistics  and  Anthropology >  Gomell  university^  and 
Gonsultant  to  fne  rand  Gorporation>  and  David  G*  Hays  of 
RAND*  fhe  first  program  to  test  the  codes  and  rules  on  a 
digital  computer  was  designed  by  Robert  Dupchaki  Gonsultant 
to  ihe  rand  Gorporationi 
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SUMMARY 

This  M^drandym  presents  a  set  of  gtanunar  oodes  and 
rules  for  analyzing^  or  "parsing,"  English  sentences  autO" 
inatically  on  a  digital  coroputer.  The  introduction  briefly 
discusses  the  nature  of  parsing  and  the  general  inodel  of 
English  structiire  on  which  this  approach  is  based^  and  the 
following  sections  explain  and  illustrate  the  codes  and 
rules  in  detail. 

Although  the  coniputer  prograin  is  not  explained  at 
length,  the  logic  underlying  it  is  illustrated  in  the  dis* 
cussion  of  the  rules.  A  coeaplete  list  of  the  rules  written 
so  far  and  a  sample  output  of  parsed  sentences  are  appended^ 

The  linguist  and  the  progrannier  will  find  the  presenta¬ 
tion  sufficiently  detailed  so  that  they  can  add  to,  refine^ 
or  modify  the  codes  and  rules  and  design  programs  for  apply* 
ing  them  to  text. 
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I.  INTRODUCTION 


PARSING  AS  INTERPRETATION 

Inteipretdtiidn  of  the  expression  these  yellow  clothes 
by  an  English  speaker  Involves  his  experience  with  language 
and  also  with  the  non^lingtiistic  worlds  That  is»  he  uses 
his  past  experience  to  make  distinctions  between  the  colors 
yellow^  red^  and  blue  and  the  objects  clothes^  ragSi  and  so 
forth,  as  well  as  between  words  that  refer  to  themi  so  far 
as  the  words  are  concerned,  machines  can  make  distinctions 
too.  The  word  yellow  is  different  from  the  word  reA  in 
certain  definitely  specifiable  physical  ways  and  it  is  not 
difficult  to  build  reactions  to  Such  differences  into 
machinery,  t'lhat  a  machine  cannot  do,  so  far  at  least,  is 
determine  whether  in  a  given  context  the  expression  is 
appropriate,  whether  or  not  there  are  objects  in  its  exper-^ 
ience  that  are  appropriately  called  clothes  and  appropriately 
called  yellow  and  appropriately  referred  to  by  these  rather 
than  those .  However,  insofar  as  the  meaning  of  an  utterance 
is  a  function  of  the  physical  contrasts  among  the  shapes  of 
sequences  of  words,  a  machine  can  be  instructed  to  differ^ 
entiate  between  these  yellow  clothes,  these  blue  clothes . 
those  red  rags,  and  the  quick  brown  fox  lumped  oyer  the  lazy 
dgg.  In  a  very  superficial  sense  the  machine's  reaction  to 
these  contrasts  can  be  considered  an  interpretation  or  even 
a  translation. 
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Buti  there  is  a  different  kind  of  contrast  in  laniiiage 
which  must  also  he  interpfetedi  In  the  expressionj  "fhese 
yellow  clothes  and  these  whiten  them,''  the  first  three  words 
contrast  in  one  way  with  the  last  three  and  in  a  different 
way  with  the  identically  shaped  sequence  appearing  in,  "I 
like  these  yellow  clothes 4"  To  instruct  a  machine  to  inter* 
pret  the  second  kind  of  difference  is  to  teach  it  to  parse. 
Automatic  digital  computers  are  capable  of  receiving  such 
instruction,  but  first,  grammars  must  be  written  for  themi 

The  English  codes  and  parsing  rules  heing  developed 
at  RAND  are  essentially  a  ffiachine  grainaar.  to  far,  this 
grarmnar  has  enabled  the  computer  to  parse  successfully  a 
variety  of  sentences  which  include  coordinate,  suhordinate, 
relative,  indicative,  and  interrogative  clause  structures, 
it  will  be  further  evaluated  by  having  the  computer  apply 
it  to  the  parsing  of  large  amounts  of  unedited  text.  It  is 
obviously  incomplete  and  inadequate  at  present,  even  more 
so  than  grammars  written  for  people,  but  it  can  be  easily 
expanded  and  revised.  It  represents  a  fruitful  approach  to 
the  problem  because  it  takes  advantage  of  the  computer's 
ability  to  perform  a  large  number  of  simple  operations  with 
great  speed,  it  can  be  written  very  compactly,  and  it  can 
be  progransned  by  exact  or  algorithmic  rather  than  by  emplr<» 
ical  or  heuristic  methods.  Moreover,  it  does  not  force  a 
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single  Interpretation  on  a  sequence  of  words  but  is  designed 
to  preserve  ambiguities  that  actually  exists  so  that  loss  of 
information  in  the  course  of  parsing  is  reduced^ 

A  MODEL  FOR  A  MACHINE  GRAMMAR  OF  ENGLISH 

The  relationships  among  words  in  meaningful  sequences^ 
are  structured^  and  grammars  can  be  viewed  as  models  of  the 
structiufes  in  the  language  for  which  they  are  devised;  or 
at  least  as  based  on  or  providing  insight  into  a  model  under* 
lying  those  structwes.  in  some  languages  the  shapes  of 
the  words  with  their  derivational  and  inflectional  affixes 
play  a  major  part  in  providing  information  for  establishing 
their  relationships  with  each  other.  In  English  and  some 
other  languages,  word  order  is  crucial  though  not  decisive. 

How  one  gets  from  the  linear  or  t^poral  ordering  of  elements 
to  their  structural  relationship  is,  then)  a  major  task  to 
which  a  grammatical  analysis  of  English  sentences  muse 
address  itself.  It  appears  possible  in  English  to  consider 
the  words  in  a  sentence  as  projections  from  the  branchings 
of  a  tree*like  structure.  Above  the  word»level,  the  arrange* 
ment  of  the  branches  appears  to  be  binary  in  general  so  that 

~lt  is  customary  to  regard  the  sentence  as  the  largest 
grammatically  structwed  sequence  in  a  language;  the  relation* 
ships  among  sentences  in  paragraphs  and  larger  csiits  are 
relegated  to  problems  of  style. 
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eadh  node  eends  t«  kave  no€  more  than  Ewo  branches  ^  We  are 
accustomed  to  descriptions  of  sentence  structure  that  are 
basically  binary:  a  sentence  consists  of  subject  and 
predicate;  the  subject  consists  of  a  substantive  and  Its 
modifiers;  the  predicate  consists  of  verb  phrase  and  comple’^ 
ments;  the  verb  phrase  consists  of  verb  and  auxiliaries ^  and 
so  forthi  The  elements  of  each  substructure  are  usually 
adjacent  and  any  departure  from  this  arrangement  In  English 
may  be  very  significant *  as  in  the  inversion  of  subject  and 
auxiliary  to  differentiate  statements  from  questions ^ 

(hie  can  easily  think  of  examples  for  which  this  analysis 
is  unsatisfactory,  but  it  does  provide  a  model  for  powerful 
generalizations  about  English  sentences  and  it  lends  itself 
to  machine  manipulationt  For  these  reasons,  the  parsing 
program  is  based  on  this  "immediate  constituent"  analysis, 

A  computer  is  programmed  to  take  sentences  as  input  and  to 
relate  the  words  in  them  in  paired  substructures  or  "con’* 
stitutes"  until  all  of  the  material  in  a  sentence  has  been 
accounted  for.  The  output  is  a  description  of  the  structv^al 
relationships  of  the  words  in  each  sentence  which  may  sub* 
sequently  be  used  as  input  for  information  retrieval,  machine 
translation,  content  analysis,  linguistic  research,  or  even 
as  machine  instructions. 
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In  prdGe$Sj,  She  computer  is  supplied  with  two  kinds 
of  inforniation:  inforination  about  the  potential  eaGh  word 
has  for  entering  into  graimnatiGal  GOnstruGtions  with  other 
words  (forining  constitutes)  and  inforTnation  about  the 
envirotunents  in  which  these  potentials  are  realized^  The 
first  is  Gontained  in  a  glossary  which  attaches  to  each  word 
or  fortn  ocGurring  in  a  text  a  code  indicating  the  syntactiG 
role  or  roles  it  may  play.  This  is  its  parts-of-speeGh 
Gategoryi  The  second  is  contained  in  a  set  of  parsing  rules 
whichi  theoretically,  lists  all  of  the  grammatically  per¬ 
missible  combinations  of  adjaGent  codes,  and  supplies  a  new 
code  called  a  resultant  for  each  combination.  These  are 
the  two  parts  of  the  machine  grammar  which  will  be 
presented  in  detail. 
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II.  TOE  GRAMMAR  CODES  IN  flj;  GLOSSARY 

THE  PROBLEM  OF  HOMOGRAPHY  AND  AMBIGUITY 

EfigILish  aboufids  in  hdffidgf&phs>  words  whoso  shapes  are 
IdeatiGal  but  whose  meaniugs  and  syntaotic  properties  are 
widely  diflerenti  e.g.,  bear  ("tursus,"  nOtM)  and  bear 
("sustainj"  verb).  Homography  of  verb  and  noun  is  especially 
prevalenti  e.g.^  walk,  run,  set:  and  all  three  of  the  inain 
word  classes“-“verb,  noun,  and  adjective-^exhibit  it  to  a 
degree  likely  to  sturprise  anyone  who  has  not  scanned  word 
lists  with  a  view  to  assigning  parts»of*speech  categories. 
This  hoffiography  accounts  for  the  ambiguity  in  the  sequence 
these  yellow  clothes .  cited  previously.  Often  the  addition 
of  a  single  itsiii  of  context  is  sufficient  to  resolve  the 
ainbiguity  (cf .  i  the  bear  and  to  bear) ;  sometimes  a  complete 
sentence  lends  itself  to  two  or  more  syntactic  tnterpretas 
tlons  even  when  the  total  context  is  considered.  This 
problem  of  ambiguity  in  words  and  constitutes  can  be  met> 
in  part>  by  assigning  multiple  codes  to  each  homograph  or 
ambiguous  sequence;  that  is>  each  possible  classification 
could  be  coded  separately.  Another  way  of  handling  the 
probing  is  to  devise  a  system  of  classification  that  recog^ 
nizes  recurrent  patterns  of  homography  and  ambiguity.  The 
lattet  method  is  the  one  adopted  here.  Words  that  can  occur 
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only  as  verbs  are  coded  with  a  1  in  firsc  poslcipiij  words 
rliat  occur  only  as  nouns  wirh  a  2  In  first  position,  but 
those  that  can  be  either  are  coded  with  a  3,  This  principle 
of  classification  is  followed  when  patterns  of  holography 
and  ambiguity  are  sufficiently  generals  Only  peculiar 
homographs  such  as  well^  miaht.  can^  and  that  are  given 
more  than  one  code.  well,  for  example,  is  given  one  code 
as  verb/noun  and  another  as  an  adverbial  adjective,  since 
this  is  an  unusual  combination  of  syntactic  ranges.  That 
is  assigned  the  same  code  as  this  in  the  determiner  group, 
but  also  receives  a  unique  code  as  a  relative  particle. 

All  forms  of  have  and  the  finite  forms  of  ^  have  separate 
codes  as  verbs  and  as  auxiliaries «  This  method  of  coding 
is  geared  to  a  parsing  logic  developed  by  Joim  Cocke  of  IBM, 
which  preserves  ambiguities  until  they  are  resolved  by  the 
addition  of  larger  context. 

GLOSSARY  CODE  FORMAT 

Codes  have  been  attached  to  words  in  a  glossary  con» 
sisting  of  approxijBately  7000  entries  compiled  from  text. 
Each  word  is  keypimched  in  colvmins  1^30  of  a  standard  IBM 
card.  Coluons  35»41  contain  a  word*numberi  columns  49”52 
contain  the  code.  Forms  which  are  stoply  inflected  variants 
of  a  common  stem  are  assigned  the  same  word*numbcr  in  order 
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£d  fediice  She  duplicattdn  of  semaii^lc  Inforffiaclon  £ha£  isiay 
be  scored  vi&h  Ehein  laser;  however^  peGuliar  homographs  are 
ehsered  Svlcej  wish  separase  word-nymbers  for  each  etisryi 
For  example  she  series  does,  did,  doing >  done  has  one 
word-number  as  forms  of  she  main  verb  and  4g>  does  4  did  has 
anosher  word-nunber  as  forms  of  she  auxiliary.  So  far,  no 
need  has  arisen  for  assigning  more  shan  swo  word-numbers  so 
a  form. 

The  code  assached  so  each  word  or  form  cons is ss  of  a 
ssring  of  four  dig is s.  The  dig is  in  she  firss  p os is ion 
indicases  she  major  parss-of-speech  assignmens,  shose  in 
second  and  shird  posisions  subclass ily  by  coding  she 
Suffixes  of  inflecsed  words  or  she  synsacsic  propersies  of 
uninllecsed  ones.  Funcsion  words  wish  unusual  or  unique 
synsacsic  propersies  and  forms  consaining  numerals  and 
mashemasical  symbols  are  given  special  codes.  Digiss  in  she 
foursh  posision  code  some  of  she  verb  and  noun  subclasses 
of  individual  words  and  she  classes  of  conssisuses  essablished 
during  parsing. 

The  moss  regular  pars  of  She  sysse®  is  the  verb-noun* 
adjecsive-deserminer-aujciliary  secsion  ruiming  from  1000- 
9500.  These  groups  have  morpho- synsacsic  propersies  of 
sense,  nunber,  or  degree,  which  furnish  general  criseria 
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for  classifiGatidn.  In  GOfigrast  with  these,  the  forms  Goded 
with  a  0  in  first  position  are  a  mixed  lot,  mostly  unlnfleGted, 
many  with  unique  functions i  It  has  seemed  hest  at  this  stage 
to  isolate  several  words  with  unusual  properties  and  assign 
them  arbitrary  Godes  at  the  beginning  of  the  Oxxx  series, 
consequently  the  "system"  here  is  a  somewhat  disorderly  com¬ 
promise  between  classification  and  partial  listing.  No 
attempt  was  made  to  provide  codes  for  all  of  the  unlnflected 
particles  but  only  for  those  actually  appearing  in  our 
limited  text 4 

At  the  other  end  of  the  system,  codes  96xx  have  been 
temporarily  reserved  for  numerals,  alphabet  sequences  not 
known  to  be  acronyms,  mixtures  of  numerals  and  letters,  and 
foreign  words ^  It  is  assumed  that  the  numerals  and  alphabet 
sequences  will  function  like  nouns  and  noun  modifiers  for 
the  most  part,  but  until  further  investigation,  we  prefer  to 
keep  them  separate  from  the  main  word  classes  and  particles* 

THE  MAJOR  CLASSES 

Table  1  illustrates  the  major  classes  distinguished  by 
the  first  position  codes  0-9* 


*1D* 


fUble  1 

Major  Par£s-o£>Speech  ClassifleaEion:  Positioft  1  Codes 


Code 

Meanini; 

Examples 

0 

Unique  forms i  imlnflected  parE- 
iGles  (adverbs,  prepositions 
and  conjunGtions) ;  relatives 
and  interrogative  pronotois. 

^  (Note'i  adverbs  formed  by  add¬ 
ing  -Iv  to  an  adjeGtive  base 
are  Goded  as  infleeted 
adjeGtives.) 

very,  and^  why, 
in,  by,  that,  who 

1 

Verb  only . 

see^  civilize 

2 

Notoi  or  pronoun  (excepting  pro¬ 
nouns  that  may  also  be  deter- 

death,  civilization^ 
John,  he,  theirs 

3 

miners,  e»g»,  hisl . 

4 

AdjeGtive  only  (including 
adverbs  with  -ly) » 

uariGe,  beiepnone 

true>  beautiful 

5 

Verb  or  adjective. 

Clean >  clear 

6 

Noim  Or  adjeetive. 

kind.  Corporal 

7 

Verb,  novin#  or  adjective. 

brief,  yellow 

8 

Determiners  (articles,  demon- 
stratives,  quantltatlves, 
possessive  pronouns) . 

a,  the,  this, 
many,  five,  my 

9 

To.  be.  auxiliaries. 

to,  be,  have,  do,  can 

Par  £ Icles  4  Rela  £ ives .  and  Infeerragafclves 

SubGlassificaisidn  and  listing  of  the  words  Goded  0  in 
first  ppsition  are  iilustrated  in  fables  2^  3j  and  4^  (Mly 
three  of  the  fotn:  positions  allotted  in  a  coinplete  code  are 
usedt  so  all  fourth  position  Godes  are  zero^  This  position 
will  be  used  for  further  subclass if ioat ion  after  the  program 

has  been  tested  to  see  what  other  adjustments  are  needed. 

fable  2 

Unique  PartiGles 


Code 

Words 

Comments 

andt  and^’Or 

Coordinating  Gonjunctions  appearing 

0020 

or 

in  the  working  text. 

but 

0040 

not 

fhese  vary  in  syntactiG  function, 

only 

but  all  can  modify  adjectives  and 

0060 

very 

some  types  of  adverbs « 

0070 

as 

0080 

than 

0090 

too 

0100 

somewhat 

0110 

hOw 

These  can  be  both  adverbs,  conjunG» 

0120 

when,  where 

tions,  and  interrogatives.  As  inter*- 

0130 

why 

rogatives,  they  appear  with  Inverted 
order  of  subject  and  auxiliary. 

there 

These  pro-^adverbs  are  separated  from 

here 

the  rest  because  of  their  use  in 
inversions.  "There"  is  a  cosBoon 
inverter. 

0190 

etc. 

Will  require  special  handling  when 
rules  are  written  for  punctuation. 
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fable  2  lists  speGific  words  with  their  arbitrary 
GodieS>  odcupying  a  block  of  numbers  running  from  0000=0190. 
fhey  are  grouped  very  roughly  on  the  basis  of  some  shared 
syntactlG  oharaGteristics,  noted  in  the  coiiments. 

fable  3  illustrates  the  subclasslfieatlon  of  preposl= 
tionSi  adverbs i  and  conjunctions,  fhe  principle  of  GOdlng 
to  Indicate  homography  and  ambiguity  is  applied^  so  that 
a  word  like  after,  which  functions  as  preposition^  adverb ^ 
or  con|unctJon>  is  assigned  to  a  class  whose  code  symbol 
in  second  position  differs  from  symbols  assigned  to  wljhj 
again,  or  which  are  all  members  of  unambiguous  classes. 

Not  counting  the  null  class >  seven  classes  are  theoretically 
possible,  but  the  category  "adverb/conjunction"  has  been 
Omitted  because  the  only  words  belonging  to  it  are  hiaw. 
when .  where .  and  why,  which  have  already  received  arbitrary 
codes . 

fhis  group,  occupying  the  block  of  codes  Ogxx-OTjgc,  is 
further  subclassified  In  third  position  on  the  basis  of  ability 
to  postmodify  a  noun  or  premodify  a  simple,  unaf fixed  verb. 

At  the  level  of  individual  words,  this  subelassifies  only 
adverbs,  which  by  definition  ean  all  modify  verbs.  Words 
that  are  only  prepositions  or  only  eonjunctions  perform  none 
of  these  functions,  although  constitutes  containing  them  can, 
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Table  4 

Relatives  and  ln£erroga lives 


Code 

Word 

Des crip lion 

0800 

lhai 

relaitve  only^  singular /plural  agre^eni 

0810 

whai 

ini erroga live,  delerminer,  singular /plural 
agreement 

0820 

which 

relaiivei  interrogaiive,  deierminer, 
singular /plural  agreemeni 

0880 

who 

relative,  interrogative >  singular/plijral 
agreement,  nominative 

0840 

whose 

relative,  interrogative,  determiner,  sing- 
ular/piural  agreemeni 

0830 

whom 

relative,  interrogative,  singular /pi  wal 
agreement,  accusative 

0860 

how  much 

interrogative,  determiner,  singular  agreement 

0870 

how  many 

interrogative,  determiner,  pl\nral  agreement 

Consequeiigly ,  ehe  glossary  codes  for  adverb  classes  and  snb* 
classes  will  also  be  resulcanf  codes  for  constitutes  con* 
taining  the  other  tiwo  classes. 

fable  4  lists  the  codes  for  a  group  of  complexly  related 
words  which  share  certain  functions  as  relatives >  interroga* 
tives,  and  interrogative  determiners ,  although  only  two  of 
them  combine  all  of  these  ftmctions.  In  this  group,  that 
is  only  relative.  Another  that .  with  a  separate  word  number, 
also  appears  in  the  determiner  group  (Sxxx)  where  its  code 
is  identical  with  the  code  of  this .  since  as  determiners, 
this  and  that  are  syimnetrical  with  these  and  those,  and 
general  rules  can  be  written  for  their  pai-siug.  fable  4 
includes  arbitrary  codes  for  the  constitutes  how  much  and 
how  many . 

the  ContentrWord  Classes 

fable  5  contains  the  codes  for  subclasses  of  the 
Content^word  Classes  with  the  first  position  codes  repeated 
for  easier  reference,  fhe  reader  will  note  that  two  posi* 
tions  are  used  to  code  the  inflectional  suffixes,  fhe  order 
of  positions  in  which  the  suffixes  are  coded  is  conditioned 
by  the  peculiarities  of  Er^lish  homography.  As  noted  pre* 
viously,  English  nouns  and  verbs  are  frequently  homographic. 
The  plural  suffix  of  nouns  is  also  homographic  with  one  of 
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djecElve  Codes 


PoelElon^S^  _  __ 

_ Po8itAon_4 _ 

Meaning: 

Example 

^Gode 

Meaning 

^xample^ 

iNo  suf^ 
fix 

Eook 

6 

Non^deterniined 

N  or  A  ' 

i  death 

:  simple 

;  Verb  ^ 

rakes 

;  1  ; 

Intransitive  v^: 
j  non- dot erm ined 

'N  or  A 

walk 

long 

N 

plural 

boys 

men 

;  2 

IfranSitive  V; 
Non-determined 
|N  Or  A 

brief 

V/H  ^ 

dances 

3 

Two- Object  V; 
NOn-deterniined 

N  or  A 

give 

make 

light 

N  sing^^ 
ular  or 
plural 

fish 

4 

iOetermined  ti 

John ' s 

A  iiZ 

truly 

5 

iCompletely 
i  determined  N 

John 
he,  him 

:  !N  aceui* 
sacive 

him 
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the  verb  suffixes,  So  that  ^iiOfeS i  for  example,  may  be  either 
a  plural  uouu  or  a  present  tense  verb  requiring  a  singular, 
thirdi-person  subjecti  But  homography  is  also  present  vithin 
the  verb  inflections  themselves,  let  is  not  only  ambiguously 
a  nom  or  a  verb,  as  a  verb  it  is  ambiguously  an  infinitive, 
a  present  tense  form  requiring  a  plural  subject,  a  past  tense. 


or  a  past  participle,  compare: 

The  set  of  all  objects.  The  sets  of  all  objects. 
They  set  it.  Me  sets  it. 

and.  They  wanted  to  set  the  They  wanted  to  go  there^ 
tables 


They  set  the  table  every 


They  gg  there  every  day. 


They  set  it  yesterday.  They  went  there  yesterday. 

They  have  set  it.  They  have,  gone  there. 

The  homography  vithin  the  verb  form  is  coded  in  second 
position;  the  remaining  homography  of  verb  with  noun  is 
then  coded  in  third  position. 

Verbs  and  nouns  are  subclassified  in  third  position 
according  to  the  presence  or  absence  of  the  ^  suffix.  A 
zero  means  that  it  is  absent,  a  1  that  it  is  present  in  an 
unambiguous  verb  (lOix) ,  a  2  that  it  is  present  in  an  unam« 
biguous  noun  (2x2x),  a  3  that  it  is  present  but  the  stem 
is  ambiguously  noun  or  verb  (3x3x) ;  a  4  that  it  is  absent 


but  that  as  a  nouiii  tha  form  Is  ambiguously  singular  or 
plural i  The  Gode  symbol  6  is  assigned  to  accusative  forms 
of  nouns  (pronouns) ,  regardless  of  whether  or  not  they  are 
singular  or  plural^  since  they  do  not  appear  as  subjects 
nor  take  determiners,  and  therefore  do  not  require  number 
agreement  so  far  as  parsi^  is  concernedi 

Verbs  are  subclassified  in  fourth  position  according 
to  their  ability  to  take  one  or  more  substantive  objects. 

An  intransitive  verb  has  a  1  in  fourth  position  and  the 
parsing  rules  do  not  provide  for  its  being  parsed  with  a 
following  Substantive  expression.  A  2  in  this  position 
indicates  that  the  verb  is  transitive,  a  3  that  it  can  take 
both  an  indirect  and  a  direct  object.  How  these  codes  are 
modified  during  parsing  is  discussed  in  the  section  on 
Resultant  Codes  (Sec.  tv) .  Here  it  is  sufficient  to  note 
tha^  all  potential  verbs  will  have  a  1,  2,  or  3  in  fourth 
position t 

Nouns  are  subclassified  in  fourth  position  according 
to  their  ability  to  pick  up  determiners.  Boys .  for  example, 
may  form  a  constitute  with  the  or  all  or  both  together,  while 
John ' s  may  form  a  constitute  with  all,  but  not  with  the,  and 
John  and  he  never  take  determiners.  The  code  for  boys  is 
2020,  for  John ' s  2904,  and  for  Jotm  and  he  2005.  Unambiguous 


-19- 

ridims  will  have  a  0,  4,  or  5  in  fourth  posieloOi  Words 
in  the  amhiguous  V/N  or  V/N/A  Glasses  will  have  a  Ij  2j  or 
3  in  fourth  position,  depending  on  the  suhclass  of  the  verb, 
but  as  nouns  they  are  capable  of  acquiring  deteminers  and 
the  codes  1,  2,  3  are  equivalent  to  0  for  the  noun  fimctionSi 
They  are  also  equivalent  to  0  for  the  adjective  functions 
in  the  V/A  and  v/n/a  classes^ 

If  sonie  ineinbers  of  the  V/n  and  V/N/A  classes  were 
limited,  like  John's  and  in  their  ability  to  take  two 
or  more  determiners,  the  syst^  would  be  unworkable  because 
it  would  be  impossible  to  code  for  verb  transitivity  and 
determiner  limitation  simultaneously,  but  since  only  genitive 
nouns,  proper  noims,  and  prono^s  have  the  determiner  limita¬ 
tion  and  since  they  are  always  unambiguously  nouns,  no  con¬ 
flict  arises.  No  word  coded  with  a  l,  3,  or  7  in  first 
position  has  a  0,  4,  or  S  in  fourth  position,  with  two 
exceptions:  ves  and  no.  which  may  be  either  noims  or  sentences* 
are  coded  3QOO. 

To  Illustrate,  the  complete  coda  for  set  is  3603.  The 
first  3  indicates  that  it  belongs  to  the  class  V/N;  the  6 
that  it  may  be  infinitive,  present  tense,  past  tense,  or 
past  participle;  the  0  that  as  a  present  tense  verb  it 
requires  a  plural  subject  and  as  a  noim  it  is  singular; 
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the  3  in  fourth  position  that  as  a  verb  it  may  take  two 
objeGfSi 

k^enever  the  addition  of  an  infleGtional  suffix 
resolves  an  ainbiguity  existing  in  the  stein  forfn  of  a  word^ 
the  Suffixed  forffi  is  eoded  as  unambiguously  as  possiblei 
SetSi  for  exainple,  is  coded  3033  rather  than  3633,  since 
the  ^  suffix  resolves  the  ambiguity  of  the  verb  inflection, 
but  not  the  overall  V/N  ambiguity. 

Some  suffixes  create  ambiguity.  All  present  parti¬ 
ciples  of  verbs  can  function  as  nouns  and  some  can  also 
function  as  adjectives.  AGcordingly,  setting  is  coded 
3003  and  exciting  is  7002*  cf . ,  "the  excitint  of  the  atom»" 
"it  is  exciting  the  atom,"  "it  is  very  exciting > "  On  the 
Other  hand,  settings  and  findings  are  coded  2020  as  vmam- 
blguous  plural  nouns  capable  of  accepting  determiners. 

Many  past  participles  of  transitive  verbs  may  prempdify 
noimSi  but  they  are  not  coded  as  potential  adjectives 
unless  they  also  accept  the  typical  adjective  modifiers 
more,  most,  very .  and  the  inflection  - 1 v .  Broken,  coded 
5302,  belongs  to  the  latter  class;  consequently  it  was 
broken  can  be  interpreted  either  as  a  passive  or  as  a 
copulative  construction,  whereas  it  was  taken  cannot. 

Some  words  like  xmder developed  or  outspoken,  although 


-21^ 

ffiorphsldgleally  tnarked  as  If  ehey  were  pasE  participles ^ 
are  coded  as  adjectives  onlyj  since  there  are  no  verbs  to 
underdevelop  (S£  to  pjut Speak,  and  It  was  underdeveloped  and 
he  was  outspoken  are  clearly  copulative  rather  than  passivei 
The  second  position  codes  for  the  coinparatlve  and  super* 
lative  forms  of  adjectives  need  little  explanation.  The 
*er  suffix  is  a  homograph  and  some  words  ending  in  it  may 
belong  to  the  N/A  class.  An  agentive  noun  like  farmer  is 
coded  2000^  but  cleaner  1$  coded  6700  as  an  ambiguous  N/A* 
comparative.  The  superlative  suffix  is  not  homographie  and 
its  addition  to  an  ambiguous  stem  resolves  the  ambiguity. 
Cleanest .  clearest .  and  finest  are  all  coded  4700*  although 
clean  and  clear  are  5002  and  fine  is  7003. 

The  "Iv  suffix,  regarded  as  an  adverbial  inflection  of 
adjectives,  also  resolves  ambiguity  and  appears  only  witdi 
words  having  a  4  in  first  position.  Words  like  deadly . 
kindly .  and  silly  are  coded  as  uninflected  adjectives. 

Using  the  code  symbol  5  in  third  position  for  the  ’’Iv 
suffix,  thus  interrupting  the  nvtnerical  sequence  of  codes 
for  noun  subclasses  in  that  position,  is  inadvertent  and 
inelegant,  but  does  no  harm.  It  may  also  seem  odd  that 
the  suffix  was  coded  in  this  position  rather  than  in  second 
position.  The  reason  for  doing  so  is  that  only  one  digit 
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(9)  vas  still  available  in  second  position  and  it  was  needed 
for  Goding  the  genitive  nouns ,  Coding  the  genitives  in  sec* 
ond  position  avoids  inter fer'ence  with  the  singular /plural 
coding  in  third  position.  It  would  be  possible  and  perhaps 
preferable  to  fnake  two  sj^bols  available  in  third  position 
for  genitives i  one  for  the  singular  and  one  for  the  plural ^ 
and  put  the  adjective  suffix  in  second  position.  This  siay 
be  done  in  a  later  revision. 

All  genitive  nouns  are  affibiguously  singular  or  plural 
with  respect  to  subject-verb  agreeffient,  but  not  with  respect 
to  de terminer -noun  agreement;  cf , ,  ''That  man's  accounts  are 
false  but  this  man's  are  true*"  Since  genitives  are  unam¬ 
biguously  nouns i  the  code  symbol  9  in  second  position  will 
always  be  preceded  by  the  code  symbol  2  for  words  in  this 
set, 

The  symbol  0  in  third  position  codes  words  with  no 
suffixes  or  with  suffixes  already  coded  in  second  position, 
The  coding  of  the  V/N  suffix  ^  in  third  position  has  been 
explained  earlier,  and  the  use  of  the  other  symbols  should 
be  clear  from  the  examples  in  Table  5. 

The  Determiners 

The  determiners  are  primarily  subclassified  acGording 
to  two  intersecting  criteria:  their  ability  to  function  as 
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subs  t-ant*  Ives  atld  shelr  raqulrarnents  for  slngular-pltJafal 
agreenient.  these  crlceria  establish  four  of  the  five  Sub- 
Glasses  coded  In  second  position.  ^  A  single  deterinineri 
all 4  is  arbitrarily  assigned  to  a  fifth  subclass ^  because 
It  Is  the  typical  outenaost  deterininer  in  a  noun  phrase  and 
isolating  it  simplifies  writing  the  parsing  rules 4  the 
classification  is  designed  primarily  to  handle  the  deter¬ 
miners  actually  occurring  in  our  rather  limited  glossary^ 
but  some  additions  have  been  anticipated  in  designing  it 4 
Both,  for  instance,  which  does  not  appear  in  our  text, 
should  probably  be  subclassified  with  all  and  then  ftjrther 
different iatedi 

third  position  codes  for  subclassifying  this  group  are 
designed  differently  from  those  for  other  classes  and  sub* 
classes  discussed  so  far.  they  are  arranged  in  blocks  SO 
that  as  many  as  three  different  symbols  may  code  the  same 
syntactic  functioni  thus  permitting  the  assignment  of 
unique  codes  to  most  of  the  determiners,  and  the  writing  of 
special  parsing  rules  for  their  combinations  without  barring 
further  writing  of  more  general  rules  as  well  whenever 
possible.  These  blocks  of  digits  subclassify  the  determiners 
aecording  to  their  order  classes,  but  only  very  roughly, 

^One  theoretically  possible  subclass  twns  out  to  be 
^pty,  namely  the  "never  substantive;  plural  agreement"  subclass. 
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since  the  order  eless  petterning  of  these  words  appears  to 
be  too  intriGate  for  the  construction  of  any  siinple  or 
elegant:  schenie*  Codes  0*  1*  and  2  are  assigned  deteminers 
which  cannot  follow  any  others ^  Codes  3^  4^  and  5  are  assigned 
to  those  which  can  follow  thiSi  that,  those,  his,  her. 
etCij  hereafter  called  the  th-  group  *  but  which  Gannot  fol¬ 
low  all.  Code  6  is  assigned  to  nufuerical  detertniners  chree. 
four .  etc.j  (but  not  one  or  two) .  whieh  can  follow  both  all 
and  the  th-  group.  Codes  7  and  8  are  assigned  to  those  which 
can  follow  all  but  not  the  th-  group.  Code  9  is  assigned  to 
Such,  which  can  follow  any  of  the  other  detezininers  except 
a  and  the  tht  group, 

table  6  contains  the  codes  and  their  ttieanings  for 
second  and  third  positions  in  the  detertsiner  group.  Table 
7  shows  all  of  the  detersniners  occurring  in  the  glossary 
together  with  their  codes,  so  that  the  reader  can  interpret 
the  parsing  rules  for  their  individual  combinations. 

Later,  other  determiners  can  be  inserted  as  the  glossary 
is  expanded  and  finer  distinctions  made,  by  using  the  fourth 
position  for  subclassification.  Meanwhile,  this  design 
allows  one  to  write  parsing  rules  for  determiner  combinations 
such  as  many  m^ore.  this  much,  some  three  ("approximately 
three"),  which  are  always  constitutes,  as  well  as  for 
anomalous  combinations  in  which  requirements  for  singular/ 
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Table  6 

Decernilnej^  Cedes 
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plural  agreement  are  altered^  as  in  a„ little 4  a  few,  many  a . 
and  such  a. 

To.  ie.  Auxiliaries,  and  Contraeped^i^orms 

Suhclassification  of  this  group  is  illustrated  in 
fable  where  all  of  the  words  belonging  to  the  group  are 
provided  codes  whether  or  not  they  appear  in  ow  working 
glossary . 

The  infinitive  marker  to  is  assigned  the  unique  code 
9000.  Since  it  is  also  a  preposition/adverb,  its  code 
appears  in  the  parsing  rules  for  combining  prepositions 
with  objects,  and  verbs  with  adverbs.  Aithough  its  range 
of  syntactic  functions  is  peculiar,  there  saems  to  be  no 
need  at  this  stage  for  assigning  it  two  different  word- 
nxznbers  and  two  codes  4 

Like  the  intransitive  verbs  in  the  main  word- classes, 
be  and  the  auxiliaries  all  contain  a  1  in  fourth  position, 
which  becomes  a  0  in  the  resultant  code  when  they  are  parsed 
with  potential  preceding  subjects.  An  indicative  sentence 
with  a  form  of  ^  as  its  main  verb  will  be  coded  91^0  (with 
the  third  position  containing  a  0,  1,  or  2).  This  means 
that  copulative  sentences  are  easily  distinguishable  from 
those  containing  other  verbs.  Likewise,  elliptical  or 
anaphoric  sentences  with  forms  of  have,  do  or  modals  are 
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idenCiflable  In  the  resultant  cddes^  all  of  which  will  cCn* 
tain  a  9  as  first  position  code  and  0  as  fot^th  position^ 

With  the  second  position  distinguishing  the  partiGular  type.^ 

In  third  position^  codes  0*  1*  and  2  are  assigned  on 
the  basis  of  requirements  for  number  agreement  with  subjects. 

Be  is  the  only  verb  requiring  this  agrement  in  past  tense 
as  well  as  present  tense  forms ^  Gonsequently  the  code  9121 
is  not  assigned  to  any  of  its  single  forms.  Modals,  on  the 
other  handj  never  require  nt®ber  agreement  and  are  all  coded 
9421.  tlien  a  modal »  e.g.,  can 4  is  parsed  with  the  infinitive 
be.  requirements  for  number  agreement  vanish  and  the  code 
9121  is  assigned  to  the  constitute.  (Cf.,  he  was,  they  were 4 
he  can  be.  they  can  Jbe.) 

Be  differs  from  all  other  verbs  also  in  having  different 
forms  for  the  infinitive  and  for  the  firSt-perspn  singular, 
present  tense.  Codes  for  these  forms  are  assigned  arbitrar* 
ily  in  the  third  position. 

Third  position  codes  are  also  arbitrarily  assigned  to 
contracted  forms  of  8ubject*^plus^auxillary  and  to  let '  s .  Con# 
tracted  forms  with  n't  are  not  coded  separately,  but  are 
assigned  the  same  word  nunbers  and  codes  as  the  positive  forms* 

"See  Sec,  ly.  Resultant  Codes* 


Odd  Foiias  i  Jliaaerl^^^  Votds 

The  r^ainlng  £d£lns  in  £he  ffdin  which  the  wdr’king 

glossary  comes ^  are  not  English  words i  They  are  numerals  or 
alphahet  sequences,  not  known  to  be  acronyms,  or  foreign  words 
like  hoc,  et 4  bleu.  Hyphenated  sequences  of  numerals 
which  could  be  dates  have  been  coded  as  nouns i  The  rest  are 
assigned  the  first  two  position  codes  96  and  subclassified 
in  third  position.  Their  complete  codes  atei 


Alphabetic  only  9600 

Mixed  ntmierals  and 
alphabet  9610 

Arabic  niifierais  9620 

Roman  numerals  9630 

Latin  9640 

French  9650 


Only  three  rules  have  been  written  for  parsing  them.  One 
rule  allows  some  of  them  to  be  parsed  with  nouns  as  if  they 
were  adjectives,  and  two  rules  provide  for  coordinating  them 
with  and.  Such  forms  will  have  to  be  dealt  with  sooner  or 
later»®their  frequeney  is  higher  than  one  might  assume  with-* 
out  checking*«but  other  problems  have  priority  at  this  point. 
So  far,  the  coding  systan  has  been  presented  prunarily 
as  it  is  used  to  code  individual  words  and  forms,  although 
several  times  the  extended  application  of  word  codes  to 
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resultant  cddas  fo^*  GdnstituSes  functioning  like  a  single 
vord  has  been  touGhed  on^  A  fuller  discussion  of  the  result¬ 
ant  codes i  both  those  that  correspond  to  word  codes  and  those 
that  appear  only  in  the  course  of  parsing,  will  be  deferred 
until  the  parsing  rules  have  been  explainedi 
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III.  Tiffi  PARSING  RULES 

If  we  £eke  the  word  as  our  snallest  unit  and  the  sentence 
as  our  largest^  we  can  describe  parsing  as  essentially  the 
combining  Of  words  in  a  text  into  successively  larger  con¬ 
stitutes  until  all  of  the  words  between  two  sentence-end 
markers  have  been  accounted  for.  the  codes  attached  to  the 
words  indicate  what  combinations  they  may  enter;  parsing 
rules  provide  the  entry. 

METHOD  OF  APPLICATION 

Over  500  rules  appear  in  Appendix  B  in  a  highly  con¬ 
densed  form  capable  of  expansipn  in  the  computer  into  an 
estimated  2,SO0«  These  rules  simply  state  what  adjacent 
grammar  codes  can  be  combined  and  what  resultant  codes  their 
combination  will  produce*  in  the  general  form; 
xxxx  +  yyyy— •>zzzz.  The  program  will  direct  the  computer 
to  examine  all  adjacent  wor<^  in  the  input  sentence*  compare 
their  original  codes  (0^  and  with  the  pairs  combined 

by  the  rules*  apply  all  rules  with  matching  pairs*  and  carry 
the  resultant  codes  (R^)  in  the  continuing  cycle,  As  the 
cycle  continues*  the  pairs  ^  ^  ®n+2* 

+  ^x4-l  also  be  checked  against  the  rules, 

For  eximple*  assume  the  it^ut  sentence  is*  '*The  men 
went  there,''  The  originel  granipar  codes  for  each  word  will 
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be  assigned  in  a  glossary  look*up  subroutine^  The  codes, 
in  order ,  are : 

The  (Oj^)  men  (O^)  went  (O^)  there  (0^)» 
8470  2020  1101  0150 

Comparing  the  adjacent  codes  with  those  in  the  rules,  the 
computer  assigns  resultants  as  follows; 

1*  Oj^  +  O2  2  the/men 

8470  2020  2024  determined  noun 

2.  O2  +  0^  ***>  R2  '  men/went 

2020  1101  1000  sentence 

3i  0^  +  0^  Rg  *  went /there 

1101  0150  1101  past  tense,  finite, 

intransittve 

4,  Oj^  R2-4^no  rule  :  the /men  went 

8470  iOOO  (not  permitted) 

5,  +  Oj  5  the  men/went 

2024  1101  1000  sentence 

6,  R2  +  0^  =*>  R5  I  men  went/there 

1000  0150  1000  sentence 


7 .  Rj^  +  Rg 

2024  1101  1000 


the  men/went  there 


8.  +  Rgm*>no  rule  :  *  the/men  went  there 

8470  1000  (not  permitted) 


9.  R4  O^-ti-^eo  rule  :  *  the  men  went /there 

1000  0150 


(not  permitted) 
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Ilie  Goie  lOOOy  whlGh  first  appears  In  s£ep  2,  deslgnaeeS 
a  cdnSEltute  that  is  potentially  a  sentence  or  Independent 
clause  and  Is  the  resultant  code  whenever  a  finite  verh  or 
verb  phrase,  not  requiring  nusaber  a^reeinent,  is  parsed  with 
a  possible  subject  preceding  it*  The  appearance  of  this  code 
In  steps  2,  and  6  is  preinature  because,  although  the 
resulting  constitutes  can  be  sentences,  not  all  of  the  words 
In  the  string  being  parsed  have  been  accounted  for.  Only 
Step  7  accounts  for  all  of  the  words  in  the  string,  and  it 
alone  will  siurvlve  as  the  final  resultants  the  premature 
parsings  will  die  in  the  cycle.  The  survivor  will  appear 
in  an  output  consisting  of  steps  1,  3,  and  7,  corresponding 
to  the  conipleted  parsing  path. 

A  sample  output  for  several  test  sentences  is  shown 
in  Appendix  C. 

RULE  SYMBOLS 

The  rules  in  the  example  are  merely  schematic,  The  actual 
parsing  rules  are  written  in  a  much  more  generalized  and  com¬ 
pact  form  for  machine  interpretation  and  manipulation,  the 
compactness  being  achieved  by  using  alphabetic  cover  symbols 
for  various  numerical  code  patterns,  by  establishing  instruc¬ 
tion  symbols,  and  by  writing  rules  as  arrays,  when  more  than 
one  pair  of  codes  produce  the  ssune  resultants. 


Cover  „SvfflkQ  1  s 

Svipposei  for  exantplej  we  wish  go  wrlge  a  rule  sgaglng 
chag  a  pogengial  adjeGgive  plus  a  pogenrial  uouu  may  be  pug 
gogegher  as  a  nominal  GOnstiguge;  A  +  N^>  The  number 
of  different  Godes  for  words  ghac  GOuld  pargiGipage  in  ghis 
GonsgruGgion  is  exgrernely  greag;  ig  will  inGlude  nog  only  ghe 
unambiguous  Godes  for  words  like  happy  and  meni  bug  ghe  ambig‘^ 
uous  ones  for  words  like  man 4  deer .  brief  4  seg4  abaMaGfi, 
cleaner  4  and  exciging;  as  welli  in  ghe  sysgem  ghere  are 
about  sixty  differeng  codes  for  words  ghat  could  be  adjeGgives 
and  aboug  fifgy  for  words  ghag  could  be  nouns*  If  every 
possible  pair  of  Godes  were  reGorded  separagely,  3000  entries 
in  ghe  parsing  rule  for  pugging  pogengial  adjecgives  and 
nouns  together  would  have  to  be  rewritten. 

Instead,  cover  symbols  have  been  defined  for  each  of 
the  four  code  positions.  Thus  an  A  in  first  position  covers 
the  adjecglve  code  numbers  4,  5,  6,  and  an  A  in  second 
position  covers  0,  2,  3,  4,  5,  6,  7,  and  8;  an  S  in  third 
position  covers  0  and  4;  and  an  S  in  fourth  position  covers 
0,  1,  2,  and  3  (indicating  a  simple  unmodified  adjective  in 
a  preceding  A  context),  Similarly  for  notms,  N  in  first 
position  covers  the  noun  code  numbers  2,  3,  6,  and  7,  and 

~See  Table  1, 


80  for Oh  for  Che  other  positions.^  Nov,  the  rule  can  be 
written: 

AASS  +  NNNS  ResultaM  nominal  constitute. 

InatructiOn  Symbols 

The  resultant  code  for  the  pair  in  the  above  eumple 
has  not  been  specified  because  its  third  position  code 
depends  on  whether  the  original  novm  constituent  was  singular 
or  pltutal  or  potentially  either.  Since  similar  situations 
in  writing  other  rules  will  be  encountered,  another  kind  of 
symbol,  an  instruction  spobol,  must  be  defined,  which,  in 
effect,  orders  the  computer  to  copy  or  modify,  in  the  result- 
ant  code,  all  or  part  of  one  of  the  constituent  codes  whenever 
the  parsing  rule  is  applied.  Four  sueh  symbols  have  been 
defined. 

B,  in  any  position  of  the  resultant  code  in  a  rule, 
means,  "Copy  in  the  resultant  code  for  this  pair,  the  code 
symbol  in  the  corresponding  position  of  the  first  constituent' 
code."  C  means,  "Copy  the  code  symbol  in  the  corresponding 
position  of  the  second  constituent's  code."  The  rule  for 
conbining  adjective  and  noun  now  appears  in  the  form: 

AASS  +  NNMS-*<>  20C0. 

^See  Appendix  A. 
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A  ifelAted  Instifyctldn  symbol  Is  D,  whleh  directs  the 

coisputer  to  subtfeet  1  froo  whatever  eode  digit  appears  In 

the  Gorrespondlhg  position  of  the  first  oonstituent»  Its 

inain  fwiction  is  to  allow  one  to  write  resultant  codes  for 

verbs  Or  verb  phrases  plus  their  objects,  setting  the 

transitivity  of  the  resulting  eonstitute  at  one  less  than 

that  of  the  original  verb  constituent <  Ihus,  a  verb  that 

may  take  either  one  or  two  noun  objects,  represented  with 

cover  symbols  by  VVV2,  will  be  parsed  with  a  following  noun 

3 

to  produce  a  resultant  in  which  the  original  2  or  3  will  be 
reduced  by  1  and  copied  in  the  fourth  position. 

Presumably  we  could  wite  the  rule  as: 

VVV2+  NiiNN— »  IBBD. 

3  6 

9N 
6 

the  addition  of  the  9  and  6  would  be  necessary  since  the 
symbol  N  in  these  positions  does  not  cover  the  genitive  and 
accusative  noxsi  codes,  this  is  only  a  minor  problem,  as 
will  become  evident  in  a  moment.  More  serious  is  the  fact 
that  use  of  the  instruction  symbol  B  in  the  resultant  may 
lead  to  forbidden  sequences.  If  the  original  verb  was  a 
V/li  303*  or  304x,  then  a  103x  or  104x  would  appear  in  the 
resultantf  Such  sequences  should  be  avoided  because  they 
are  inconsistent  with  the  vnambiguous  1  coding  of  the  first 
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position.  This  can  be  doae  by  splitcing  the  rule  Into  two 

rules  of  the  following  forms: 

VVS2  +  NNNN-^  iBODj  where  S=  0*  4^ 

3  6 

m 

6 

and  VV12  +  lOlD. 

3  6 

32  9N 

3  6 

In  these  foms  of  the  rules,  the  ambiguities  of  the  original 
codes  are  reduGed  as  much  as  possible  in  the  resultant. 

These  two  rules  may  be  sirnpllfied  still  further  by 
using  the  fourth  instruction  symbol^  an  asterisk^  which 

means,  "Any  code  symbol  is  acceptable  in  this  position"; 
in  effect,  an  instruction  to  "skip  it,"  in  these  two  rules, 

it  does  not  matter  whether  or  not  the  verb  is  finite,  or 
present  or  past  participle;  all  that  matters  is  that  it  is 
a  verb  and  that  it  is  transitive.  Also,  the  follpwlng  noun 
may  be  singular  or  plwal,  genitive  or  accusative.  In  other 
words,  while  the  third  position  code  of  the  potential  verb 
must  be  specified  in  order  to  write  the  resultant,  only  the 
first  and  fourth  positions  are  relevant  for  determining 
whether  or  not  a  given  pair  can  be  put  together  a$  verb  plus 

^See  Appendix  A. 


object: i  Inereforei  we  can  wrlce: 


V*S2 

3 

N**N 

IBOD 

and  V*12 

lOlD 

32 

3 

tMlike  cbe  other  Instruction  s^bols  C|  and  which 
appear  only  in  resultants^  the  asterisk  appears  only  in  the 
constituent  codes  and  never  in  the  resultants.^ 

RULE  FORMAT 

The  rules  are  identified  with  an  alphabetic  symbol 
followed  by  a  number.  In  Ict^unehing,  the  fields  used  are: 


Columns 

Contents 

1-2 

Deck  identification 

5^8 

Rule  identification 

15*18 

Code  of  first  constituent 

25*28 

Code  of  second  constituent 

35*38 

Code  of  resultant 

In  order  to  interpret  more  fully  the  format  in  which 
the  rules  are  written,  let  us  return  to  our  pair  of  verb^object 

^For  a  fuller  understanding  of  how  the  instruction  and 
cover  symbols  are  used,  the  r^der  might  try  working  his 
way  through  a  few  of  the  rules  in  Appendix  B,  after  reading 
the  next  section  in  which  the  format  is  explainedt  and  the 
key  to  Rule  Symbols  in  Appendix  A. 


rules  ^  nor  lug  rhar  In  the  verb  consrituents  $  ^e  code  posl*^ 

rlons  were  occupied  In  a  second  row.  Thus,  in  the  second 

rule  of  the  pair,  V*12  was  written.  This  Is  to  be  Inter^^ 

3 

32 

3 

preted:  "if  the  first  position  code  Is  a  V,  the  second 
position  Is  not  significant  for  application  of  this  rule, 
the  third  position  may  be  a  1  or  a  3,  and  the  fourth  posl* 
tion  may  be  a  2  Or  a  3."  In  general,  whenever  a  code  field 
for  a  constituent  Is  not  filled  out  to  all  four  positions, 
each  blank  Is  interpreted  as  being  filled  by  the  symbol  in 
the  colvsnn  above  it  and  all  combinations  of  four  code 
position  sytnbols  are  permitted.  However,  when  all  four 
positions  are  filled  in  both  rows,  as  in  the  inters 

pretatlon  is  aaaa  or  bbbb. 

We  also  Interpret  the  connections  across  code  fields 

^  ^  bbbb  ^  dddd 

as  and/or  connections ,  SO  that  aaaa  +  means: 

»  Gccc  eeee 

aaaa  and  bbbb  or  aaaa  and  cccc  go  to  dddd  eeee.  These 

conventions  permit  us  to  write  rules  as  condensed  arrays 

whenever  more  than  one  subclass  or  class  of  GOnstltuents 

can  ente^  into  a  constitute,  or  whenever  more  than  one  sub» 

class  or  class  of  constitutes  are  produced  by  the  combination. 

For  example,  the  subclasses  of  transitive  verbs  may  be 
parsed  with  some  subclasses  of  determiners  as  well  as  with 


np^s  fo  prPdtiee  result:an£  verb- ob jest:  cbsieritiutes^  the 
codes  for  such  dererifiiners  may  be  added  Co  previous  rules « 
Accordifiglyj  Che  second  Of  Che  pair  of  verb-objecc  rules  ^ 
as  Ic  accually  appears  in  che  appendix ^  reads: 

R26  V*12  N**N  lOlD 

3  8W*0 

32 
3 

(w  is  che  cover  symbol  for  che  second  posicion  codes  of  all 
pocencially  subscancive  decermlners) ^  If  che  code  field 
combinacions  were  wriccen  ouc^  leaving  only  che  cover  and 
inscruccion  symbols  incacc^  che  rule  would  appear  as 


V*12 

N**N 

lOlD 

V*13 

N**N 

lOlD 

V*32 

N**N 

ioii) 

V*33 

N**N 

ioiD 

V*12 

Sw*o 

ioiD 

v*i3 

8w*o 

lOlD 

v*3t 

8W*0 

lOlD 

V*33 

8W*0 

IOID 

WhaC  ics  appearance  would  be  wich  che  cover  and  inscruccion 
symbols  caken  away*  shall  be  lefc  co  che  digical  compucer. 
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IV.  RESULTANT  CODES 


TYPES 

Fdf  pt^pdses  of  diSGussldaj  the  tesultant  codes  are 
divided  into  three  types  oii  the  basis  of  their  res^blance 
to  the  cofistituents  prodycing  them  and  to  the  codes  already 
defined  for  the  glossary  entries ^  Type  A  resembles  the  code 
of  one  of  the  constituents^  indiGating  that  the  construction 
it  eodes  is  endocentriCi  For  a  highly  endocentric  construe^ 
tion,  in  which  the  syntactic  range  of  one  of  the  constituents 
is  effectively  the  same  as  that  of  the  constitutej  the  result* 
ant  code  repeats  the  code  of  that  constituent.  In  others, 
the  resultant  is  a  modified  version  of  one  of  the  constituent 
codes.  The  modified  version  will  also  be  a  glossary  code, 
indicating  that  the  combination  of  the  two  constituents  has 
shifted  the  syntactic  range  of  one  of  them  to  that  of  a  diff»> 
erent  subclass  of  its  own  class.  Occasionally,  the  Type  A 
resultant  combines  subclass  symbols  in  a  way  not  found  in 
the  glossary,  as  in  the  combining  of  a  modal  with  ^  to  pro* 
duce  the  code  9121,  but  no  new  subclass  symbols  are  intro* 
duced  and  none  of  the  previously  defined  symbols  acquire  new 
definitions. 

Type  B  does  not  resemble  the  code  of  either  constituent, 
indicating  that  the  construction  is  exocentric.  However,  the 
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Tyfe  B  resultant  coie  Is  also  a  glossary  Gdde«  This  ffieans 
thaf  Eha  synGacsic  ratige  of  cha  consclEUEe  is  £he  same  as 
GhaE  of  a  class  of  single  words  or  forms  noE  represenEed  by 
lES  conSElEuenES. 

l^pe  C  may  code  elEher  endoGenErlc  or  exocenEric  con« 
SEriiGElons  i  biiE  IE  conEalns  a  GOde  symbol  noE  previously 
used  for  any  subclass  of  glossary  words  or  forms,  or  else 
noE  previously  defined  for  a  subclass  of  Ies  own  class. 

This  indicaEes  EhaE  Ehe  synEacEic  range  of  Ehe  consEiEUEe 
is  a  new  one,  appearing  only  when  Ewo  or  more  glossary 
classes  are  combined. 

In  Ehe  following  discussion,  Type  A  and  B  resulEanEs 
are  briefly  ex^plifie4»  but  Type  C  is  presenEed  in  more 
deEail  and  the  new  code  symbols  and  new  uses  for  code  sym^ 
bois  are  lisEed  and  defined. 

Type  A  resulEanEs  are  Ehe  mosE  frequenE.  Among  oEhers, 
Ehey  appear  in  Ehe  modif ier*noim,  Ehe  verb^objecE,  Ehe  verb* 
adverb,  Ehe  auxiliary *verb,  and  Ehe  deEerminer-deEerminer 
consEiEuEe  codes.  When  simple  adjectives  are  combined  with 
nouns,  for  example,  the  noun  code  is  repeated  in  Ehe  result’' 
ant.  When  a  EransiEive  verb  is  combined  with  an  object,  the 
resulEanE  code  is  a  modified  version  of  the  verb  code,  in 
which  the  EransiEivity  has  been  reduced.  ResulEanE  codes 
for  preposiEional  phrases  and  adverbial  clauses  also  belong 
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to  this  type,  both  being  coded  as  subclasses  of  adverbs, 
whose  major  olass  membership  they  share. 

Type  B  resultants,  though  relatively  infrequent,  can 
be  exemplified  by  the  resultant  code  for  marked  infinitives 
(Infinitives  with  to)^  which  are  coded  as  fully  determined 
singular  nouns  (2xx5).  Also,  the  8w  classes  of  determiners, 
the  potential  substantives,  when  combined  with  following 
prepositional  phrases  or  postnominal  adverbs,  are  coded 
as  nouns. 

Type  c  resultants  are  fairly  frequent.  They  introduce 
one  newly  defined  code  symbol  in  second  position,  and  one 
in  third  position,  the  rest  appearing  in  fourth  position. 
Table  9  shows  them,  together  with  the  major  classes  in 
Which  they  represent  new  subclassifications,  and  defines 
their  meanings. 

AS  Shown  in  Table  9,  the  one  newly  defined  code 
Symbol  (8)  in  second  position  is  applied  to  nouns  with 
post -modifiers.  'Hie  rules  provide  that  noun  phrases  will 
be  put  together  in  one  order  only,  in  order  to  reduce  the 
nvunber  of  final  parsings.  For  instance,  in  the  happy  man 
on  the  comer,  the  head  noun  man  will  acquire  its  adjective, 
then  the  prepositional  phrase,  and  then  the  determiner. 

The  only  newly  defined  code  symbol  iri  third  position 
is  applied  to  relative  clauses.  Since  these  clauses  are 
postnominal  modifiers,  as  are  prepositional  phrases  and 
some  a’verbs,  they  are  coded  in  the  first  two  positions  as 
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fable  9 


New  Type  C  Resultant  Cede  Symbels 


Majpx’  Class 

ios . 

code 

Meaning 

Example 

Noun 

8 

A  post^modlfler 
has  been 
attached* 

man  on  the  comer 

Adverbial 

3 

4 

Postnominal 
modifier  only; 
does  not 
modify  verbs* 

which  he  wanted; 

Whom  I  know; 
that  came 

Finite 
verb  Si 

auxlllarieSi 

contractions 

4 

0 

independent 

clause* 

he  came;  let's  go; 
he  is,  he  has,  he 
does;  he  can; 
he's  here 

Finite 
forms  of  M 

4 

2 

:  Adverbial  or 
;  adjective 
complement  has 
been  attached* 

is  happy;  is  on 
the  comer 

Finite 
forms  of  ^ 

4 

3 

inversion  with 

1  there  or  heM. 

there  is  (a  man 
here);  here  are 
(the  men) 

Adjective 

4 

4 

''Determined” 

adjective. 

very  happy; 
very  happily 

Finite 

verbs, 

nouns 

4 

6 

Interrogative 
clause,  not 
inverted;  or 
interrogative 
noun  phrase, 
not  potential 
Introducer  of 
relative  clause. 

who  came; 
what  books 

Finite 

verbs, 

auxiliaries 

4 

7 

Inverted 
interrogative 
clause,  or 
Interrogative 
noun  phrase; 
may  introduce  a 
relative  clause. 

did  he  come;  Is  he; 
has  he,  did  he; 
can  he;  whose  books 
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Table  9-^Gontinued 


Major  Class 

POS. 

Code 

Meaning 

Finite 

verbs* 

auxiliaries 

4 

a  ’ 

Doubly  inverted 

interrogative 

clause. 

;  when  did  he  come; 

:  Where  is  he;  when 

1  can  he;  wnat  books 
^  does  he  want 

Any 

4 

9 

A  ceordinating 
conjunction  has 
been  added; 
requires  a 
parallel  second 
Constituent. 

the  man  and  (woman); 
i  coming  and  (going ) ; 
happy  and  (gay ) ; 
he  came  and  (they 
:  left):  can  and 
(will):  here  and 
(there) 

If  they  were  adverbial;  but  sj.nce  they  dd  not  modify  verbs> 

the  code  symbol  4,  not  previously  defined  in  third  position 

for  the  preposition -adverb-con junction  group*  is  used  to 

differentiate  them  from  single  word  postnominal  adverbs, 

prepositional  phrases*  and  adverbial  clauses  with  subordinating 

con  junctions.  Accordingly,  who  is  on  the  .corner  is  coded 

0340,^  For  example*  compare: 

the  man  who  IS  on  the  comer  the  man  waited  when  he 

was  on  the  comer 

the  nian  yonder  the  man  waited  yonder 

the  man  on  the  comer  the  man  waited  on  the 

corner 

We  quickly  admit  that  calling  relative  clauses  "adverbial" 
and  then  specifying  that  they  do  net  modify  verbs  is  an 
etymological  contradiction*  but  will  leave  It  to  the  reader 
to  invent  a  more  felicitous  terminology  if  he  wishes. 


-it  is  also  coded  as  a  noun  phrase  and  as  an  inter¬ 
rogative  clause.  See  the  section  on  j^ltlple  Resultants. 


In  fourth  position,  the  codie  symbol  0  acquires  a  new 
definition  when  in  the  eourse  of  parsing  it  appears  with 
preceding  code  symbols  for  finite  verbs  and  auxiliaries. 

In  the  glossary,  all  potential  verbs  and  auxiliaries  have 
a  1,  t,  or  3  in  fourth  position.  Mien  their  preceding 
codes  appear  with  a  0  in  fourth  position,  it  means  that 
they  have  been  combined  with  potential  subjects  and  that 
the  resultant  constitute  is  an  independent  clause  or 
Sentence,  in  effect,  this  is  to  treat  clauses  as  endocentrlc 
construetlons  with  the  verb  or  other  finite  fom  as  head. 
Similarly,  when  a  contracted  form  such  as  he ' s  is  combined 
with  an  appropriate  second  constituent  such  as  coming, 
the  resultant  will  be  a  modified  version  of  the  second 
constituent  code,  with  a  o  in  fourth  position. 

TWO  other  fourth  position  symbols  are  redefined  to 
provide  for  constructions  containing  a  finite  form  of  the 
verb  When  one  of  these  forms  is  combined  with  a 
following  adjective  or  adverbial  expression,  the  fourth 
position  of  the  resultant  code  is  a  2,  and  the  rules  do 
not  permit  the  addition  of  further  adjective  or  noun 
complement s.  When  one  of  the  forms  is  combined  with  a 
preceding  there  or  here,  the  fourth  position  code  of  the 
resultant  is  a  3#  ao  that  the  common  inversion  patterns 
there  is,  here  are,  etc,,  can  be  identified  and  correctly 
parsed  with  following  noun  subjects. 
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The  foiiPth  position  symbol  4  has  previously  been 
defined  for  nouns  in  the  glossary  to  mean  "determined 
subclass,"  NO  other  use  was  specified  for  it  in  the 
glossary  codes.  resultant  codes,  however,  It  too 
acquires  a  new  definition.  When  adjectives  are  combined 
With  a  preceding  modifier  like  very,  the  resultant  code 
Is  a  modified  adjective  code  with  a  4  in  fourth  position^ 
meaning  "detemined  adjective>"  i.e^  it  can  no  longer  be 
parsed  with  words  like  more  and  most,  ©le  meaning  Is 
very  close  to  that  for  the  noun>  but  unlike  the  nouns, 
the  adjectives  as  single  words  have  no  determined  subclass 
Only  some  adjectival  constitutes  do-^^and  therefore  we 
regard  these  resultants  as  Type  G. 

The  completely  new  fourth  position  symbols  6,  ?>  and 
$  code  Interrogative  phrases  and  clauses.  The  symbol  6 
means  "interrogative,  normal  order"  and  is  applied  to 
questions  like,  "What  books  came  yesterday?"  (1106)>  and, 
"Who  is  here?"  (9116),  and  to  noun  phrases  like,  "What 
books?"  (2026). 

The  symbol  7,  in  fourth  position,  codes  inverted 
clauses  and  appears  in  resultants  for  combinations  of 
auxiliaries  or  forms  of  ^  with  following  substantives 
that  may  be  their  subjects.  Is  the  man,  accordingly,  is 
coded  9117  and  has  the  man,  9317.  Subsequently,  If  Is 
the  man  is  combined  with  gQlngj  the  resultant  code  is 


lOl?;  arid  if  has  the  man  is  eombined  with  gone,  the 
resultant  code  is  II07,  On  the  other  hand,  the  rules 
provide  that  a  subsequent  combination,  such  as  who  with 
is  the  man  will  produce  the  resultant  code  911^,  indicating 
that  the  clause  is  a  copulative  interrogative,  not  inverted, 
and  the  combination  of  tMt,  with  is  the  man  will  produce 
the  resultant  code  9II0,  indicating  the  clause  is 
copulative,  not  Interrogative,  and  not  inverted. 

The  Symbol  7  in  fourth  position  is  also  used  for  noun 
phrases  that  may  Introduce  relative  clauses;  for  example, 
whose  books  (2027),  which  differs  in  syntactic  range  from 
vjrat  bqoks  (2026)* 

Ihe  symbol  8  in  this  position  codes  doubly  Inverted 
clauses  liice,  "Where  is  the  man?"  and,  "v&iat  books  does 
he  want?"  in  Which  an  interrogative  adverbial  or  object 
expression  precedes  the  copulative  verb  or  auxiliary,  and 
the  subject  follows  it. 

The  symbol  9  in  fourth  position  codes  combinations  in 
Which  the  first  eonstltuent  is  a  coordinating  conjunction. 
Strictly  speaking,  these  are  not  constitutes,  but  the  rules 
establish  them  as  temporary  or  pseudo  constitutes  and 
provide  that  they  may  subsequently  be  parsed  with  preceding 
parallel  expressions.  For  example,  a  singular  or  plural 
substantive  expression  combines  with  and  to  produce  the 
resultant  2xx9.  This  in  turn  may  combine  with  either  a 
singular  or  plural  substantive  expression  to  produce  a 


plmral  nouifi  phrase  eoded  txtO  (or  1x24  or  2X25i  if 
first  noiuA  belongs  to  a  detepftiAed;  subclass)* 

The  use  Of  such  teMporary  cohStitutes  can  be  extended 
to  cover  GOMparative  constructions  lllice  a  happier  man  than 
he  *  as  happy  as  he*  etc*,  as  well  as  the  elther^or,  neither 
nor  constructions^  for  which  rules  have  not  yet  been 
developed* 


MULTIPLE  RESULTANTS 

Although  combining  two  constituents  usually  reduces 
ambiguity >  in  some  instances  ambiguity  is  created  or 
increased  and  consequently  more  than  one  resultant  code 
must  be  specified  for  the  constitute*  This  is  exemplified 
by  clauses  beginning  with  whose  *  The  expressipn,  whose 
books  are  here  is  potentially  a  question,  a  postnominal 
modifier,  and  a  singular  nonn  phrase.  The  rules  for 
coding  Such  expressions  provide  three  resultants:  9106, 
0340#  and  2003.  Similarly,  who  carne  is  coded  1106,  0340» 
and  2005.  Most  of  the  multiple  resultants  appear  in  Rules 
U62.»U72*  As  with  words  like  ^  and  have,  v/hich  have  more 
than  one  code,  each  code  will  be  considered  for  possible 
combination  with  adjacent  codes  in  executing  the  parsing 
program. 


V.  THE  OUTPUT 

The  codes  and  miles  have  heen  tested,  using  a  program 
written  In  IPL^V,  a  list “processing  language.  As  was 
expected,  the  program  proved  excessively  time  consuming, 
requiring  approximately  ten  minutes  for  parsing  long 
sentences j  A  program  currently  being  developed,  using 
SCAT  and  adapted  specifically  to  these  codes  and  rules, 
has  reduced  the  time  to  a  matter  of  seconds. 

A  variety  of  sentences  was  hand  ceded>  bypassing  a 
dictionary  look-up  subroutine,  and  fed  into  the  computer 
together  with  the  rules  and  the  program.  The  computer 
parsed  them  "correctly"  (i*e*,  as  predicted).  Appendix  C 
dlsplayi  some  of  these  sentences  with  their  glossary  and 
resultant  codes  and  the  rules  which  led  to  completed 
parsings,  all  represented  in  a  tree  structure  form  clearly 
showing  the  relationships  of  the  substructures.  Except 
for  the  summary  information  on  sentence  length  and  number 
of  resultants  and  parsings.  Appendix  c  is  the  actual  output 
Obtained  automatically  from  an  IBM  7090. 

The  output  Includes  both  very  short  sentences,  designed 
to  test  the  ability  of  the  rules  to  handle  basic  sentence 
types,  and  longer  sentences,  taken  from  text  with  little 
modification,  to  test  for  the  resolution  of  ambiguities. 
Specifically,  the  problem  was  to  determine  whether  the 
codes  and  rules  as  developed  so  far  could  handle  diverse 
structures  commonly  found  in  English  and  resolve  the 


ambiguities  preserved  in  the  coding  of  homographs  sufficiently 
to  avoid  an  excessive  number  of  parsings ^  On  both  counts, 
the  test  results  appear  sufficiently  favorable  to  warrant 
contlnulni  development  of  this  approach  to  automatic 
parsings 

Multiple  parsings  still  pose  serious  problems.  In 
general,  syntactic  ambiguities  tend  to  compound  themselves 
so  that  for  each  unresolved  ambiguity  in  a  sentence,  one 
may  expect  that  the  number  of  parsings  obtained  for  that 
sentence  will  be  doubled*  That  is,  one  ambiguity  in  the 
subject  phrase  and  one  in  the  predicate  phrase  will  produce 
four  parsings,  not  two,.  On  the  other  hand,  this  also  means 
that  each  time  codes  or  rules  are  added  or  refined  to 
resolve  a  needlessly  ambiguous  construction,  the  number 
of  different  parsings  for  sentences  containing  the 
construotion  will  probably  be  halved. 

Most  of  the  multiple  parsings  appearing  for  some  of 
the  test  sentences  were  caused  by  the  presence  of 
prepositional  phrases  and  other  adverbial  modifiers.  The 
codes  and  rules  are  not  now  sufficiently  refined,  to  specify 
When  these  constitutents  should  be  attached  to  the  sentence 
as  a  whole  or  to  some  one  of  several  constituents  within 
it.  The  addition  of  rules  for  punctuation  will  help, 
although  the  probiem  cannot  be  solved  by  this  means  alone. 

More  accurate  codes  and  rules  for  prepositional  phrases 
would  greatly  reduce  the  number  of  unwarranted  parsings. 


One  immediate  step  might  well  he  the  decoding  of  the 
preposition  ^  and  the  writing  Of  special  rules  for  phrases 
in  which  it  occurs i  It  is  one  of  the  most  fred^ent  words 
in  ordinary  text;  it  usually  attaches  Its  phrase  to  an 
Iffiffiediately  preceding  noun*  and  the  classes  of  verbs  and 
adjectives  to  which  an  M  phrase  can  be  attached  are 
limited.  The  most  extreme  example  of  multiple  parsings 
in  the  test  sentences  was  sentence  15i  for  which  thirty 
parsings  were  obtained.  ®ie  number  would  be  halved  if  the 
phrase  of  text  were  attached  solely  and  unambiguously  to 
the  preceding  noun,  passages. 

As  the  codes  and  rules  now  stand,  all  sentences 
ending  with  two  or  more  prepositional  phrases  or  with  a 
prepositional  phrase  following  a  noun  object,  will  receive 
multiple  parsings,  frequently#  such  sentences  are  truly 
ambiguous,  as  in  the  test  sentence>  "1  saw  the  man  with 
the  telescope  in  the  park,"  and  all  the  parsings  are 
legitimate  interpretations.  Sometimes  they  are  not,  as 
in,  "The  boy  put  the  book  on  the  table,"  and  "The  boy  read 
a  book  on  mathematics,"  where  only  one  parsing  is  warranted. 
The  problem  of  eliminating  the  unwarranted  parsings  while 
preserving  legitimate  ambiguities  will  obviously  require 
finer  cpding  of  noun#  verb,  and  adverbial  classes. 

An  additional  ambiguity  is  present  in,  "l  saw  the  man 
with  the  telescope  in  the  park,"  and  twice  as  many  parsings 
would  have  been  obtained  for  it  if  the  additional  glossary 


GOd:e  for*  jaw  as  a  present  tense  form  of  the  vento  to  saw 
had  been  read  in.  In  a  completely  automated  processing 
of  text,  this  would  occur.  Its  elimination  here  was 
entirely  arbitrary  and  was  done  solely  to  save  needless 
processing  time  rather  than  to  minimize  the  gravity  of  the 
problem  of  homography.  fo  prevent  absurd  Interpretations 
in  w;hlch  men  may  be  dissected  with  so  blunt  an  instrument 
may  require  the  use  of  mlcroglossaries  or  the  development 
Of  a  classification  scheme  for  semantic  co-occurrence 
classes. 

Similarly i  the  homography  of  a  single  word  doubled 
the  number  Of  parsings  in  test  sentence  14,  where  the 
Sequence  code  combine s  subclass  .symbols  occurs*  In  one 
set  of  parsings^  combines  is  interpreted  as  a  plural  noun> 
modified  by  the  singular  noun  code ,  and  subclass  is 
interpreted  as  the  verb  with  symbpls  as  its  object.  In 
the  other  set*  code  Is  Interpreted  as  the  subject,  combines 
as  the  verb,  and  subclass  as  the  noun  modifier  of  the 
object  noun  symbols .  Another  unexpected  ambiguity  turned 
up  in  the  test  sentence,  "Is  the  man  master  there."  Ihe 
answer  to  the  first  parsing  presumably  could  be,  "Yes,  the 
man  Is  master  there";  to  the  second,  "No,  the  man  master 
isn't,  but  the  woman  master  is,"  These  examples  serve  to 
raise  the  question  of  the  use  of  larger  contexts  In  the 
resolution  of  ambiguities  and  suggest  one  of  the  dimensions 
in  which  the  present  limited  scope  of  parsing  is  inadequate. 


In  spite  of  these  pFobieinS^  which  would  he  difficult 
to  solve  in  Implementlr^  any  apppoach  to  automatic  parslngj 
there  are  many  advantages  to  the  approach  presented  here* 

It  preserves  legitimate  ambiguities.  It  does  not  try  to 
force  a  narrow  interpretation  of  "sentence"  upon  every 
string  of  words  appearing  between  two  end “marks  of  punctuation. 
We  do  not  need  to  speculate  or  predict  whether  or  not  there 
will  he  a  predicate  after  we  have  encountered  a  noun  phrase  * 

If  there  is  one,  and  there  is  no  subordinating  context, 
the  string  of  words  will  be  parsed  as  an  independent  clause 
or  sentence  and  its  final  resultant  code  will  mark  its 
type  as  indicative,  copulative,  anaphoric,  or  interrogative* 
or  some  combination  of  these,  and  its  order  as  normal# 

Inverted,  or  doubly  inverted*  if  it  is  not,  and  the  string 
is  simply  a  noun  phrase,  as  in  a  title,  or  Some  other  so- 
called  elliptical  construction  like  the  man  Over  there  or 
not  until  tomorrow,  the  final  resultant  code  will  so  label 
it.  Whether  the  total  structure  is  a  phrase,  clause,  or 
complex  Sentence,  this  approach  permits  its  substructures 
and  their  connections  to  be  clearly  marked  and  the  nature 
of  the  connections  to  be  clearly  indicated. 
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Appendix  A 
RULE  SYMBOLS 


Any  pdsltiidn:  *  the  code  in  chts  position  is  not  signif- 

icent  ior  the  applicetion  o£  this  tule. 

B  Copy  the  digit  in  the  corresponding 
position  of  the  first  eonstituent  code. 


Position  1: 


Position  2: 


C  Copy  the  digit  in  the  corresponding 

position  of  the  second  constituent  code. 

D  Subtract  1  before  copying  the  digit  in 
the  corresponding  position  of  the  first 


constituent  code 
M 

V  1*3. 5. 7 
N  2. 3. 6. 7 
A  4, 5*6. 7 

V  0....6 

N  0, 2,5*6, 7 
A  0*2, 3, 4, 5, 6, 7, 8 

S  0,5,6 

T  1*4,6 
F  0*1,4, 5,6 

F  3,4,5, 6 

W  1,2, 3,5 

X  2, 5, 6, 7 


Content  word  classes 

Verb 

Noun 

Adjective 

verb 

Noun,  not  genitive 

Adjective 

Simple  finite  or 
infinitive 

Fast  tense  verb 

Finite  verb 

Fast  participle 

Noun  substitute 
determiner 


Frep  sition 


Y 

3>5:,7 

Adverfe 

2 

^,6:.,  7 

SufeoE’dina  ting  c  On  June  1 1  on 

Position  3: 

V 

0>1,3,4 

Verb 

N 

0,2, 3, 4 

Noun,  not  accusative 

A 

0>4,5 

Adjective 

B 

0,4 

Simple  form 

Y 

0,1, 2, 3 

Adverb 

Position  4: 

V 

li2,3 

Verb 

N 

0,*..5 

Noun 

S 

0,1, 2, 3 

Unmodified  noun  or 
adjective 

Appendix  B 
RULES 

The  3?ailes  are>  in  gener^al^  iroiiped  according  to  the 
nuiiher  Of  the  resnltant  codes  appearing  in  the  right ^^hand 
cojurani  fhe  first  column  is  the  rule  identification  tag. 

The  second  column  contains  the  codes  of  preceding  constituents 
the  third  column  contains  the  codes  of  following  constituents j 
the  fourth  column  contains  the  resultant  codes. 


Rl 

N»*N 

VT*V 

lOOo 

8w*0 

R2 

N*4N 

VSVV 

1000 

83«0 

C 

R3 

3 

N«ON 

V*1V 

lOOo 

81*0 

R4 

N«2N 

% 

vssv 

1000 

D 

82*0 

954 1 

5 

R5 

9511 

V2SV 

1000 

4 

P 

R6 

9521 

VPSV 

lOOo 

R7 

9531 

VSSV 

lOOO 

p 

R8 

4*5N 

1**0 

GCCO 

OVOO 

91*0 

1 

2 

2 

i 

0150 

4 

6 

R9 

20*6 

1**7 

1CC8 

7 

0810 

2 

4 

5 

6 

7 

RIO 

0110 

1**7 

1CC$ 

2 

91*7 

3 

2 

4 

R15 

9101 

V2*V 

lOBb 

7 

P 

11 

7 

R16 

9121 

y/Z9y 

110  b 

7 

p 

R17 

9201 

VP*V 

IpBb 

7 

11 

7 

R18 

9221 

VP«V 

11  Ob 

7 

R19 

9301 

vssv 

lOBB 

7 

11 

1 


*61- 


R20 

9321 

7 

4  1 

7 

VSSV 

11  OB 

R25 

V*S2 

3 

N**N 

8(11*0 

1800 

R26 

V*12 

3 

32 

N**N 

8N»0 

1010 

R27 

3 

V*SV 

Oyyo 

40  5N 
0150 

IBOl 

R28 

v*iv 

3 

o 

OyYO 

405* 

015* 

6 

loll 

R29 

405N 

OYiO 

2 

V*SV 

IGOC 

R30 

4050 

OYlO 

V*1V 

3 

ICIC 

R|S 

£ 

A«SN 

NNSS 

29*0 

VPS  2 

NNSS 

2OC0 

R36 

3 

A*SN 

NNSS 

29*0 

VPS  2 

NN2S 

3 

2020 

R38 

NNSS 

0300 

2 

2880 

R39 

NN2S 

3 

3 

0300 

2 

2820 

R4q 

8000 

11 

2 

28SO 

4 

9  0 

4 

2C05 

R41 

8200 

1 

2 

2820 

4 

920 

4 

2C25 

R42 

8300 

1 

40 

1 

50 

28*0 

4 

9  0 

4 

2ee5 

R43 

8130 

28S0 

4 

5 

6 

7 

8 

9 

9 

R44 

8230 

2820 

4 

5 

6 

7 

8 

9 

R45 

8330 

28*0 

4 

5 

6 

7 

8 

9 

43 

4 

5 

6 

7 

8 

9 

R44) 

N*0S 

81*0 

VR*V 

R47 

N*2S 

3 

82*0 

vt»*v 

R48 

N*4S 

83*0 

VP*V 

R50 

8000 

NNSS 

1 

2 

10 

1 

2 

4 

R5l 

8200 

NN2S 

1 

4 

2 

3S 

4 

4S 

4 


2C04 

2C24 

2CC4 


200  & 
2025 

2045 

2005 

2025 


K52 

8300 

1 

2 

40 

1 

2 

30 

R53 

1 

2 

8300 

1 

£ 

40 

1 

2 

1 

o 

R54 

d 

81*0 

R55 

82*0 

R56 

83*0 

■C 

R57 

5 

8130 

4 

5 

6 

7 

8 

R58  8130 

4 

5 

6 

7 

8 

R59  8230 

4 

5 

6 

7 

8 

R60  8330 

4 

5 

6 

7 

8 

9 

43 

4 

5 

6 
7 
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N'^2S 

4 

3S 

4 

2023 

N'NSS 

4 

20G3 

0340 

2003 

0340 

2023 

0340 

204 

NNSS 

2004 

4 

29SS 

2904 

NN2S 

2024 

4 

3S 

4 

4$ 

4 

NNSS 

20C4 

4 

-64- 


8 

9 

53 

4 

5 

6 

7 

8 
9 

R6i  8330  NiN2S  2024 

4  4 

5  3S 

6  4 

7 

8 

9 

43 

4 

5 

6 

7 

8 

9 

53 

4 

5 

6 

7 

8 
9 

R62  9131  N**N  2004 

8W*0 

AASS 

0Y*0 

0150 

6 

VP*V 


R63 

28«0 

0340 

2005 

9 

R64 

NNSS 

0340 

2005 

R65 

MN2S 

0340 

iQ2p 

3 

R66 

81*0 

0100 

2800 

R67 

82*0 

c 

0300 

2820 

2 

R68 

83*0 

0300 

2840 

5 

2 

R09 

90*0 

vssy 

2005 

9151 

3 

5 

96»0  Nt*S  20CU 


R70 


R7l 

N*«N 

9600 

1 

2 

3 

20GC 

R7S 

0810 

2 

6 

NN$N 

20C6 

R76 

0840 

MNSN 

200  7 

R77 

0810 

2 

7 

NiN2M 

i 

2026 

R78 

0840 

NiN2N 

2027 

R79 

0810 

2 

29*iSI 

29C6 

R9d 

0060 

9 

0100 

ASAS 

2 

VI»*V 

40G4 

R91 

01 00 

470S 

4 

4705 

R92 

4050 

ASSS 

2 

VP*V 

4004 

R93 

8330 

ASSS 

2 

VP*2 

3 

4700 

R94 

8330 

A*5S 

4050 

A 

R9S 

8340 

ASAS 

2 

VP*2 

3 

o 

4800 

R96 

ASAS 

8310 

4005 

UO 

c 

8G00 

8140 

2 

8300 

U1 

8170 

8100 

823» 

eegu 

U2 

8170 

4 

8150 

8150 

03 

8100 

3 

4 

5 

6 

20 

3 

4 

5 

6 

8330 

88H0 

U9 

8300 

4 

8330 

8300 

-66- 


U5 

O 

o 

CO 

8140 

5 

6 

24 

8C1C 

U6 

8300 

4 

8350 

8120 

U7 

8370' 

8 

47 

8 

8340 

5 

6 

8370 

U8 

8500 

8260 

7 

8210 

U9 

0060 

8100 

4 

8230 

4 

aCGQ 

UlO 

OilO 

8100 

0860 

Uli 

0110 

8230 

0870 

UU 

8230 

8390 

8000 

8000 

U20 

9101 

7 

11 

7 

9131 

9180 

U2l 

9201 

9141 

9101 

5 

U22 

9211 

7 

21 

7 

31 

9141 

9180 

U23 

9207 

9141 

9107 

U24 

94*1 

7 

9151 

912y 

U25 

0040 

9131 

9CCb 

U26 

9  WO  1 

7 

11 

7 

21 

7 

401 

7 

11 

7 

21 

7 

0040 

9B8b 

U30 

9511 

N*5N 

81*0 

3 

5 

A*SN 

VP*  2 

3 

9100 

-67- 


U31 

9511 

0 1 50 

9100 

6 

0Y*0 

UBI 

N»*N 

9121 

9120 

dM«0 

2 

7 

U33 

N*SN 

9111 

9110 

81*0 

2 

3 

7 

5 

U34 

N*2N 

9101 

9100 

3 

2 

4 

7 

82*0 

3 

5 

U35 

91*1 

OYVO 

91B2 

2 

0150 

6 

A**N 


036 

91*0 

OYYO 

91BB 

6 

0150 

6 


AObN 


037 

91*1 

0150 

9183 

038 

0 1 50 

91*7 

9iGu 

6 

039 

OYOO 

91*6 

91CC 

1 

8 

2 

40  5  N 

040 

N**N 

9221 

9C20 

8M*0 

3 

4 

041 

N*SN 

9211 

9C10 

81*0 

a 

3 

J 

5 

042 

N*2n 

9201 

9000 

3 

4 

82*0 

3 

5 


050 

9101 

N*2N 

9B0  7 

2 

3 

3 

4 

9103 

82*0 

3 

5 

U51 

9111 

NSSN 

9B17 

2 

8 

«68« 

2  8 

3  9 


9113 

81*0 

3 

5 

U52 

9121 

2 

3 

4 

N»*N 

8M*0 

982  7 

U53 

91*7 

8 

0150 

6 

OVYO 

A**N 

9188 

U54 

9107 

N*2N 

3 

4 

82*0 

3 

5 

9108 

U55 

9117 

N*SN 

8l»0 

3 

9118 

U56 

9127 

5 

N**M 

8W*0 

9128 

U59 

9151 

N**N 

A*$M 

9155 

U66 

0800 

VFVV 

91*1 

7 

8 

92*1 

3 

4 

03411 

U61 

0810 

2*  -6 

1000 

91*0 

2 

3 

4 

2005 

U62 

9231 

y8*v 

034g 

2005 

U63 

080  * 

1 000 

034U 

2 

5 

2**7 

91*0 

2 

3 

/. 

2005 

U64 

0840 

H 

1000 

0340 

2005 

U65 

0810 

2046 

VFVV 

1006 

2005 

U66 

0860 

7 

2**6 

VTVV 

lOOo 

2CC5 

-69- 


U67 

0860 

VSIV 

1006 

2006 

3 

200S 

U68 

0870 

2026 

vssv 

1006 

2005 

U70 

0820 

3 

2047 

Vt'VV 

1006 

0340 

2006 

U7l 

2007 

VSIV 

3 

1006 

034u 

U72 

2027 

VSSV 

C\J\J  0 
1006 
0340 
2005 

U73 

0x*0 

9000 

N»*N 

8m*0 

0300 

U86 

0810 

9101 

91C6 

2*46 

7 

11 

7 

21 

7 

2005 

U81 

2*06 

9111 

9106 

0860 

7 

21 

7 

2005 

U82 

2*26 

9101 

91G6 

0870 

7 

21 

7 

2005 

U83 

0820 

9101 

9106 

3 

7 

0340 

4 

2*«7 

11 

7 

21 

7 

2005 

U84 

2007 

9111 

7 

21 

7 

9106 

0340 

2005 

U8S 

2027 

9101 

7 

21 

7 

9106 

0340 

2006 

U90 

0030 

0040 

0010 

y9l 

0010 

2 

3 

M*** 

8h** 

961* 

2 

01** 

A 

910* 

1 

2 

942* 

0009 

-70« 


Ut9 

VF#v 

VF»9 

IbbC 

QO 

\  *#9 

202^ 

8W*N' 

8w*9 

Ql 

VF*0 

VF*9 

1  bOu 

Q2 

V2*V 

V2*9 

I2'0l 

Q'i 

VP«V 

VP*  9 

1301 

Q4 

AVSS 

AVS9 

4ooy 

Q5 

4#S0 

4059 

40  50 

Q6 

A7*S 

A  7  *9 

4700 

Q7 

4800 

4809 

4800 

Q8 

91  #0 

91*9 

9108 

1 

2 

3 

4 

Q9 

96*0 

96*9 

9600 

Qio 

0110 

0119 

OiOu 

2 

a 

2 

Qii 

.7 

ox*o 

3 

ox  *9 

0239 

Q12 

OYYO 

0YY9 

CCGo 

0150 

0159 

6 

6 

Q13 

0340 

0349 

0340 

Q20 

N**N 

0190 

8  BOB 

A**N 

8w*N 

0Y*0 

Q2S 

0040 

NN!NN 

CGC5 

AAAA 

8W*0 

0150 

6 

0Y*0 


Q27 

0050 

H**N 

CGCG 

8W*0 

0150 

6 

0Y*0 


Q28 

NN*N 

9 

8W*0 

005* 

CGGC 

030 

0110 

1**0 

0330 

2 

91*0 

3 

2 

0Z*0 

3 

4 


Appendix  C 

SAMPLES  OF  PARSED  SENTENCES 

Most  tree^^stfUGture  representations  of  sentenGes  show 
nodes  with  branGhes  to  the  left  and  right*  aeGording  to  the 
order  in  whiGh  the  elements  appear  In  the  sentenGe.  The 
trees  appearing  on  the  following  pages*  however,  branGh 

to  the  right  and  down.  The  code  appearing  in  the  upper 
left-^hand  corner  is  to  be  considered  as  the  topmost  node. 

It  is  as  if  a  normal  tree  had  been  converted  to  its  mirror 
image  and  rotated  through  an  angle  of  45°. 

The  rules  by  whiGh  eaGh  node  was  eonstruGted  appear 
beneath  the  resultant  code.  (It  should  be  noted  that  a  0 
in  the  first  position  of  a  GOde  was  eliminated  in  the 

print •out.) 

The  number  of  different  parsings  Obtained  for  each 
sentence  is  recorded  and  for  some  sentences,  all  the  parsings 
are  printed  out.  Where  there  were  very  many  parsings, 
only  a  few  are  printed  out,  since  the  others  are  simply 
alternate  combinations  of  the  ambiguous  elements  in  the 
ones  already  displayed,  in  addition,  the  number  of  resultants 
appearing  during  the  processing  is  recorded.  Some  of  these 
resultants  do  not  survive,  but  the  number  affords  a  better 
index  to  the  length  of  time  required  for  processing  than 
does  the  length  of  the  sentence. 


-72- 


SENTENCE  1 


Sentence : 
Length : 


Let's  ti7  to  parse  this  one. 
6  words 


Numher  of  parsings:  1 

Number  of  resultants:  13 


*73- 


icce 

•  ««» 

95A1 

• 

« 

LfT*S 

ICCI 

«#»• 

3CC2 

R25 

* 

* 

TRY 

2GG5 

f 

9CCC 

R69 

A 

A 

tc 

IGCI 

««•» 

1CC2 

K25 

A 

A 

PARSi: 

815C 

817C 

U2 

A 

A 

815C 

GKE. 

THIS 

-74- 

SENfENCl  2 

Sentence i 
Length ; 

Number  of  parsings : 

Number  of  resultants: 


We  hope  that  this  will  run. 
6  words 
1 

19 


-75- 


ICCC 

R4 

«»#» 

2G2S 

ICCl 

R25 

«  *«# 

3GG2 

f-dPi 

2GG5 

1.63 

t  #4  » 

G8CC 

THAT 

icce 

R1 

«««« 

8i7e 

This 

lici 

R2C 

«44i 

9421 

hlLL 

35C2 

RLN# 

-76- 

SENfENCi  3 

Sentence : 
length : 

Nimber  of  parsings: 

Number  of  resultants: 


Some  men  were  coming  and  goings 
6  words 
1 

11 


*77- 


ICCG  *»•* 

2C25 

RA 

R52 

m 

• 

* 

• 

« 

2C20 

• 

KIN 

# 

m 

IGCI 

^ICl 

R15 

WERE 

« 

« 

12C1 

1209 

C2 

ygi 

» 

• 

« 

• 

» 

ccic 

* 

ANC 

« 

ft 

12C1 

GClNG 

8100 

SGf<E 


1201 
CCf INC 


SENfiNGl  4 


Sentence : 

Length  s 

Nuffi'ber  of  parsings: 
Niunber  of  resultants; 


Vfliy  did  he  go? 
4  words 
1 
6 


-79 


1106 

C13d 

RIG 

WHY 

» 

ft 

1107 

9327 

•••«  9321 

R2G 

C52 

CiD 

ft 

ft 

ft 

ft 

ft 

2005 

ft 

HE 

ICCl 

sc 


SENTENGE  5 


Sentenee ; 

Length : 

Nuffibef  of  parsings: 
Number  of  resultants: 


There  is  a  man  there, 
3  words 
1 
6 


911G 

•••*  ^lic 

ft44« 

0150 

y36 

L36 

THERE 

• 

i 

^  1 1 7 

««#• 

9111 

LSI 

IS 

4 

4 

2CG5 

•  •4# 

a  000 

R5C 

A 

• 

4 

li 

BCC2 

RAN 

CISC 

THERE 


SENTiNOi  6 


Sentence  : 
Lengtli ; 


Is  the  man  there 
4  words 


Number  of  parsings :  1 

Number  of  resultants;  4 


9118  9117  •«««  9111 

UI3  151  IS 

•  • 

*  « 

*  2C0A  ••••  8470 

*  R6d  tht 


ICC2 

PkH 


0150 

ThtRE 


SENfENGE  7 

Sentence ; 

Length : 

Number  of  parsings: 

Number  of  resultants: 


IS  there  a  man  there 

5  words 
1 

6 


9111 

is 


9ii8 

9117 

«•••  9113 

U53 

LSI 

U37 

# 

i 

• 

• 

• 

* 

« 

0156 

« 

« 

THERE 

« 

« 

» 

m 

2CC5 

•••«  8000 

* 

R5G 

A 

# 

# 

« 

• 

SCC2 

» 

CISC 

THERE 


SENTENCE  8 


Sentence:  The  man  is  happy  there 

liength :  5  words 

Number  of  parsings:  g 

Niimber  Of  resultants:  ll 


i 


9  lie  *•## 

2CG^  ***♦ 

8470 

uil 

R66 

THE 

• 

• 

» 

• 

• 

3CC2 

» 

RAN 

# 

• 

9112  •••• 

9112 

9111 

y35 

U3S 

IS 

# 

• 

« 

• 

• 

9CC0 

» 

HAPPY 

« 

» 

01  SO 

THlRi 

-88- 

gilC  •••• 

Sllc 

2004  **•# 

8470  f 

U36 

V33 

R6C 

THE  ^ 

0 

4 

• 

i 

1 

4 

• 

» 

i 

« 

3002 

i 

# 

• 

PAN 

- 

« 

« 

^112 

91U 

4 

L35 

is 

4 

m 

m 

m 

4C0C 

4 

HAPPY 

CISC 

ThERE 

f 

i 


Sentence ; 
Length i 
Number  of 
Number  of 


SENTENCE  9 

Is  the  man  happy  there 
5  words 

parsings :  1 

resultants:  4 


-90« 


9118  •«**  9118  ••••  9117  9111 

U53  L53  U51  IS 

•  •  • 

*  •  * 

•  •  2009  *•••  8970 

*  •  R60  the 


3002 

MAN 


9000 

HAPPY 


disc 

THlRi 


-91“ 

SENfENCE  10 

Sentence:  Is  the  man  master  there? 

Length :  5  words 

Number  of  parsings:  'k 

Number  of  resultants:  11 


\ 

\ 

I 


-92 

9118 

5117 

9111 

Uft 

USl 

IS 

• 

• 

2GCA 

8470 

R60 

THE 

i 

« 

2CC0 

3002 

R3§ 

MAN 

« 

1 

3GC)2 

• 

PASTER 

0150 
THE  R  E 


-93- 


9118 

U53 


«•••  9118  •••• 

155 


9117 

9111 

U51 

is 

• 

• 

2009 

8970 

R60 

THE 

• 

• 

3002 

MAN 

3CC2 

HAStER 


0150 

There 


Sentence : 
Lengtfe: 
Number  of 
Number  of 


-94* 

SENfENGl  11 

VtiO  is  in  charge  of  production  there? 
7  words 

parsings :  2 

resultants:  2l 


9116  ♦** 

U36 


OlSC 

rwiRE 


9U6 

•  #«» 

9tU 

136 

U83 

« 

• 

n 

• 

« 

9111 

IS 

f 

« 

G36di 

•  f  »  '• 

0530 

173 

IN 

# 

2800 

•  #«« 

3002 

R38 

CHAR re 

i 

« 

CBCC 

0230 

L73 

OF 

i 

• 

2CCC 

PRGCLCt ION 

0830 

MHO 


9116  *« 

L36 


015C 

THiRE 


9116 

9116  »•«* 

9ll6 

136 

U36 

08  3 

# 

* 

# 

• 

# 

9111 

• 

is 

« 

« 

0300 

05  30 

U73 

IN 

• 

• 

3002 

CHAftrg 

0300 

0230 

L73 

GF 

• 

2000 

PRGGLCr ICK 

•*  0830 

WHO 


Sentence : 
Length : 
Ntimber  of 
Number  of 


SENTENCE  12 

The  boy  put  the  book  on  the  table 
3  words 

parsings :  2 

resultants:  27 


-98- 


ICGC 

«««« 

2CC4 

ftftftft 

8A70 

R1 

R6e 

TH:fe 

• 

ft 

ft 

ft 

ft 

2CGC 

ft 

BCY 

ft 

ft 

ftftftft 

i6Gl 

ftftftft 

16C2 

R27 

R2S 

PUT 

ft 

ft 

ft 

ft 

ft 

2eC4 

ftftftft 

8A70 

ft 

R6C 

THE 

ft 

ft 

ft 

ft 

ft 

3CG2 

ft 

eccK 

ft 

ft 

OICC 

ftftftft 

C53C 

U73 

CK 

ft 

ft 

HC^ 

*«*• 

8A7C 

RfeC 

THt 

3CC2 

tabled 


iCGC 

«••• 

2GC4 

Rl 

R60 

» 

ft 

i 

ft 

» 

2CGG 

« 

« 

BCY 

l6Ct 

ftftftft 

UG2 

R2§ 

• 

* 

PUT 

2eC4 

«••• 

8470 

R^5 

« 

• 

THE 

28CC 

ftftftft 

3GG2 

Rig 

» 

t 

ecCK 

03  GC 

ftftftft 

C530 

U7B 

ft 

ft 

GN 

2GG4 

*«ftft 

8470 

RfeG 

« 

• 

3CC2 

TA8LI. 

THE 

84  70 
THI 


-100« 

SENfENCE  13 

Setilenee : 

Length : 

Nuniher  of  parsings: 

Number  of  resultants: 


X  saw  the  man  with  the  teieseope  in 
the  park^ 

10  words 
4 

37 


-101 


IGGC 

«#•» 

2C25 

R1 

« 

• 

I 

iiGi 

ft  i«« 

1102 

R25 

« 

« 

SAW 

28CA 

««•« 

8A7C 

«4  5 
• 

ii 

The 

2eCG 

«««• 

3C02 

R38 

• 

• 

PAN 

OIGG 

ft  •«« 

C230 

U13 

» 

• 

WITH 

2ec<i 

ftftflft 

8470 

R^5 

• 

• 

THE 

2800 

««#  ft 

3CC2 

Rie 

• 

• 

TELeSGOPl 

0300 

ftftftft 

CS30 

U73 

• 

* 

IN 

2CC<i 

ft  ftftft 

8470 

R4C 

• 

t 

3G02 

PURK 

THE 

-102 


IGGG 

ftftftft 

2G25 

Kl 

I 

ft 

ft 

lici 

ftftftft 

llCl  *•** 

lie? 

R27 

R26 

SAW 

ft 

• 

ft 

« 

ft 

2CG<t 

8A70 

ft 

K6C 

THE 

ft 

• 

ft 

« 

ft 

3GC2 

ft 

RAN 

ft 

ft 

GBCC 

ftftftft 

C23G 

U73 

WITH 

ft 

ft 

2ICA 

ftftftft 

eA7G 

RA% 

TNE 

ft 

ft 

2€CC 

ftftftft 

3CC2 

Rl8 

tiLf SGCPI 

ft 

ft 

cicc 

ftftftft 

C53C 

U73 

IN 

ft 

ft 

2CC<t 

ftftftft 

e^Tc 

MG 

THE 

3CC2 

P«RK 


ICCC 

2C2b 

R1 

I 

» 

• 

llCl 

lici 

«ftft» 

1102 

Ii27 

R25 

SAh 

« 

# 

2  EC  A 

ftftftft 

84  70 

RA5 

THE 

m 

m 

2  sec 

3002 

R38 

HAN 

« 

1 

C3GC 

0230 

L73 

WITH 

f 

« 

2CCA 

#ftftft 

8A70 

R4C 

The 

# 

ft 

3CC2 

TELESeOPE 

OICC  •••#  C53C 
U 7  3  IN 

t 

• 

2CC^  «•**  8A7e 
ReC  THE 

• 

• 

3eC2 

PARK 


-104- 


IGCG 

2C2S 

Kl 

I 

# 

« 

lici 

«««• 

1101 

*««• 

ilot 

•••*  1102 

^2  7 

R27 

R25 

SAW 

« 

ft 

• 

ft 

# 

« 

ft 

2004 

8470 

# 

ft 

R60 

THE 

• 

ft 

• 

« 

ft 

• 

« 

ft 

3002 

# 

ft 

HAN 

• 

ft 

• 

ft' 

ft 

G  300 

ftftft* 

0230 

ft 

L73 

WITH 

ft 

ft 

ft 

ft 

ft 

2CCA 

#«#• 

84  70 

ft 

R6G 

TH4 

ft 

ft 

ft 

ft 

ft 

3002 

ft 

ft 

TELHSeCFfe 

ft 

03CG 

C53C 

U73 

ft 

IN 

ft 

2CCA 

8<»7C 

K^G 

THE 

« 


3CC2 

P4RK 


SlNfENCE  14 


Sentence : 


Length : 

Number  Cf  parsings: 
Number  of  resultants: 


Occasionally  the  modified  code 
combines  subclass  symbols  in  a 
new  way  but  no  new  undefined  symbols 
are  introduced ^ 

18  words 

10*  (Five  parsings  only  shown, ) 

249 


*Seven  of  the  ten  parsings  are  due  to  coding  the 
Word  combines  3032 . 
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• 
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• 
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• 
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« 
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» 
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• 

« 
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» 

2820 
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3002 
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• 

« 

4 

# 
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2020 

• 

« 

R39 
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» 

• 

« 

f 

« 
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0300 
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IN 

• 
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• 
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• 
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• 
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2 £20 
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4000 
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H 
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Ri5 
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f 
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14C2 
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ICGO 

4C5G 

Rg 
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• 
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8470 
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R3 
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# 

ft 

• 

• 

ft 

• 

i 

ft 

2000 

1402 

# 

ft 

R35 

KQOIFIEC 

« 

ft: 

• 

• 

ft: 

« 

« 

ft. 

3002 

« 

ft 
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« 

ft 

« 

ft 

ft 

1011 

•  ••ft 

3032 

ft 

R26 
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ft 
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ft 

ft 

ft 

2820 

•••ft 

2020  •••• 

3002 

ft 

R39 

R36 

SUBCLASS 

ft 

ft 

• 

ft 

ft 

* 

ft 

ft 

2G20 

ft 

ft 
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ft 

ft 

ft 

f 

ft 
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0530 

ft 
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IN 

i 

ft 

ft 

ft 

• 

f 

20C^ 

•  ••ft 

8  GOO 

ft 

R50 

A 

ft 

ft 

ft 

ft 

ft 

20CO 

ftftftft 

4000 

ft 

R35 

NEW 

ft 

ft 

♦ 

ft 

ft 

ft 

20C0 

ft 

ft 

ft 

CC3C 

* 
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ICQO  •••« 

2C25 
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R4 

R52 

NC 
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40CO 
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R36 
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« 

• 

t 

• 

* 
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4CC0 

« 

Rl6 
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t 

• 

« 

• 

# 

2G2Q 

« 

SYf»eCLS 

# 

* 

ICGl 

51CI 

Rl5 

ARE 

t 


U02 
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ICCO 

4C5C 

Re 

GCeASIQNALLY 

• 

• 

iCGG 

«««# 

ICG9 

loeo 

44  44 

2004 

4444 

8470 

Qi 

tJ9l 

R3 

R60 

THE 

• 

4 

* 

# 

4 

• 

« 

4 

2CC0 

4444 

1402 

4 

R35 

RQOiFlie 

* 

4 

• 

« 

4 

• 

4 

4 

3G02 

4 

4 

CGCE 

4 

4 

4 

4 

4 

1011 

4444 

icii 

*4*4 

3032 

4 

R23 

R26 

GOt»'BiNf  S 

4 

4 

• 

4 

4 

• 

4 

4 

2C20 

*444 

3002 

4 

4 

R  36 

SUBCLASS 

4 

4 

• 

4 

4 

• 

4 

4 

2G20 

4 

4 

SYRBGiS 

4 

4 

4 

# 

4 

C300 

4444 

0530 

4 

U73 

IN 

4 

4 

4 

4 

4 
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4 
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4 
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4 

4 

• 
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4  COO 
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4 

4 

4 

4 

4 
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4 

4 
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4 
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ft 

ft 
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•' 

ft 
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ft 

ft 
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« 

« 

« 

• 

ft 

« 

ft 

ff 

ft 

fl 

ft 
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ft 

i 

ft 

fl 

ft 
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ft 

ft 

« 

ft 

« 

ft 

« 

ft 

f 
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« 
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« 

ft 

« 

ft 

« 
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« 
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f 
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# 

ft 

• 
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f 
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« 

ft 

« 

ft 

« 
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« 

ft 

f 

ft 

« 
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« 
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f 

ft 
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ft 

« 
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f 

« 
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t 
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R4 
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« 

• 
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• 
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• 

• 
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• 
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# 

« 

2C2C 
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« 

• 
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• 
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G1 
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« 

ft 

« 
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ft 

« 
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« 

ft 

ft 
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ft 

# 

« 

ft 
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1402 
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ft 

ft 
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# 

ft 

ft 

ft 

« 

ft 

ft 

ft 

• 

ft 

ft 

3002 

m 

ft 

ft 
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• 

ft 

ft 
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ft 

ft 

i 

ft 
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• 

ft 
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ft 

ft 

« 

ft 

ft 

« 

ft 
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« 

ft 
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# 

ft 

ft 

ft 
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ft 

ft 

f 

ft 
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ft 

ft 
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ft 

# 

ft 

* 
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ftftftft 
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» 
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IN 

# 

ft 

ft 

« 

ft 
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ft 
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A 

t 

ft 

ft 

• 

ft 

ft 
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ft 
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* 
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f 

ft 

ft 

ft 
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« 

ft 
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t 

• 

ft 

ft 
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t 

f 

ft 
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« 

• 

ELT 
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R15  ARE 
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Sentence : 

Length: 
NiMbe?  of 
Number  of 


SENfENCE  15 

This  paper  die  scribes  a  method  for 
discriminating  between  passages  of 
text  received  from  different  souroes 
15  words 

parsings :  30  (Pour  parsings  only  shown^ ) 

resultants:  264 
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iccc  2ec<i  *•••  ai7e 

r!  RS7  This 

«  • 

•  • 

*  3CC2 

•  PAPIB 


icii  •«*#  IGll  1011 
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101 t 
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R26 
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i 

M 

• 

2005 
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R50 

A 

• 

• 
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METMQD 

0300 

0630 

U73 

FOR 

3201 

discriminating 


03CO  *•••  0230 
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• 

• 

2C20 
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C3CC  0230 
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« 

f 
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• 

t 
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RECEIVEC 


0300 

••••  C230 

U73 
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« 

9 

2C2C 
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R3e 

CIFFERENT 

2C2C 
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ICCC 

ftftftft 

2GCA 

ftftftft 

8170 

R3 

THIS 

ft 

ft 

3CC2 

P4PfiR 

1C  11 

ftftft* 

leii 

ftftftft 
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lOll 
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K28 
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ft 

ft 
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ft 

ft 

ft 

• 

• 

ft 

ft 

• 
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ft 

ft 

• 
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ft 

ft 

# 
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ft 
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• 

ft 

ft 

ft 

• 
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ft 

ft 

• 
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ft 
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• 

ft 

ft 

• 

ft 

ft 
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ft 

ft 
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ft 
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• 
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* 

ft 

ft 
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ft 

ft 
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ft 

ft 

ft 

ft 
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ft 
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ft 

ft 

ft 

ft 
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ft 

ft 

ft 

ft 

ft 
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ftftftft 
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ft 
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CF 

ft 

ft 

ft 

ft 

ft 

2CC5 

ftftftft 
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ft 

K4i6 

TEXT 

ft 

ft 

ft 

ft. 

ft 

14C2 

ft 

ft 

RECEIVEL 

ft 

C3CC 

ftftftft 

e23C 

U73 

ERCR 

ft 

ft 
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IGGG 
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ft 
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0300 
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ft 
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ft 

ft 
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ft 

ft 
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ft 
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112 
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ft 

ft 
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ft 
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ft 
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OF 
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ft 

ft 
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ft 
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ft 

ft 
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R2 
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ft 
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ft 
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ft 

icii 

ftftftft 
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ft 
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A 

ft 

ft 
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2GCO 
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ft 

ft 
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ft 

ft 
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ft 
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ft 
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Sentefiee : 


Length : 
Number  of 
Number  of 


SENTENCi  16 

These  blocks  of  digits  subclassify 
the  dietermlners  very  roughly  since 
the  ordier-class  patterning  of  these 
words  appears  to  be  too  intricate 
for  the  construction  of  any  simple 
or  elegant  Scheme  ^ 

30  words 
parsings :  5 

resultants:  368 
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ft 
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yisi 

ff 

UbO 

eb 

« 

ft 

ft 

ft 

ft 

ftftftft 

0090 

R9G 

fCG 

ft 

ft 

ft 

ft 

ft 

ACCO 

ft 

INTRICATC 

ft 

C3CC  ft#«ft 

C6  30 

in 

FCR 

ft 

ft 

2CC4  *«ftft 

8470 
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ft 
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ft 
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ft 

ft 
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OR 


4C0O 
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SOI^ef'E 
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• 

2820  ••••  3032 

R3^  BLOCKS 
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R6i 
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ft 
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ft 

CiTERHINERS 

ft 

ft 

4054 

ftft  ftft 

0060 

R90 

VERY 

ft 

ft 

4050 
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« 

ft 

t 
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ft 
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4 
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0230 

iJ73 

OF 

# 

# 

2G20 

ClGItS 

icci 

«•«# 

lOCl 

ftftftft 

ICOl  *•** 
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• 
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ft 

ft 
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CRCER-eLASS 

ft 

« 

ft 

f 

ft 
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ft 
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9 

ft 
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ft 

ft 
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• 
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« 
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